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GEm EXPRESSION TECHNIQUE 
FIELD OF THE INVENTION 

5 The present application relates to gene expression techniques. 
BACKGROUND OF THE INVENTION 

The class of proteins known as ohaperones have been defined by Hartl 
10 (1996, Nature, 381, 571-580) as a protein that binds to and stabilises an 
otherwise unstable conformer of another protein and, by controlled binding 
and release, facilitates its correct fate in vivo, be it folding, oligoroeric 
assembly, transport to a particular subcellular compartment, or disposal by 
degradation. 

15 

BiP (also known as GRP7S, Ig heavy chain binding protein and Kax2p in 
yeast) is an abundant ~70kDa chaperon e of the lisp 70 family, resident in 
the endoplasmic reticulum (ER), which amongst other functions, serves to 
assist in transport in the secretory system and fold proteins. 

20 

Protein disulphide isomerase (PDI) is a chaperone protein, resident in the 
ER that is involved in the catalysis of disulphide bond formation during the 
post-trantslational processing of proteins- 

25 Studies of the secretion of both native and foreign proteins have been 
shown that transit from the ER to the Golgi is the rate-limiting step. 
Evidence points to a transient association of the BiP with normal proteins 
and a more stable interaction with mutant or rais folded forms of a protei-n. 
As a result, BiP may play a dual role in solubilising folding precursors and 

1 
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preventing the transport of unfolded and unassembled proteins. Robinson ^ 
and Wittrup, 1995, Biotechnol. Prog. 11, 171-177, have examined the effect ^ 
of foreign protein secretion on BiP (Kar2 P ) and PDI protein levels in 
SaccharomycB* cerev&ae and found that prolonged constitutive expression " 
of foreign secreted proteins reduces soluble BiP and PDI to levels 
undetectable by Western analysis. The lowering of ER chaperone and 
foldase levels as a consequence of heterologous protein secretion bas 
important implications for attempts to improve yeast expression/secretion 
systems. 

Expression of chaperones is regulated by a number of mechanisms, 
including the unfolded protein response (UPR). 

Using recombinant techniques, multiple PDI gene copies has been shown to 

increase PDI protein levels in a host cell (Farquhar et al, 1991, Gene, 108, 
81-89). 

Co-expression of the gene encoding PDI and a gene encoding a 
heterologous disulphide-bonded protein was first suggested in WO 
93/25676, published on 23 December 1993, as a means of increasing the 
production of the heterologous protein. WO 93/25676 reports that the 
recombinant expression of antistasm and tick anticoagulant protein can be 
increased by co-expression with PDL 

25 This strategy has been exploited to increase the recombinant expression of 
other types of protein. 

Robinson et al 7 1994, Bio/Technology, 12, 381-3S4 reported that a 
recombinant additional PDI gene copy in Saccharomyces cerevisiae could 
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be used to increase the recombinant expression of human platelet derived 
growth factor (PDGF) B homodimer by ten-fold and Schizosachararnyces 
pombe acid phosphatase by four-fold. 

5 Hayano et al, 1995, FEBS Letters, 377, 505-511 described the co- 
esxpression of human lysozyme and PDX in yeast Increases of around 30- 
60% in functional lysozyme production and secretion were observed. 

Shusta et al, 1998, Nature Biotechnology, 16, 773-777 reported that the 
10 recombinant expression of single-chain antibody fragments (scFv) in 
Saccharomyces cerevisiae could be increased by between 2-8 fold by over- 
expressing PDI in the host cell. 

Bao & Fukuhara, 2001, Gene, 272, 103-110 reported that the expression 
15 . and secretion of recombinant human serum albumin (rHSA) in the yeast 
Kluyveromyces lactis could be increased by 15-fold or more by co- 
expression with an additional recombinant .copy of the yeast PDX. ..gene 
(KIPDI11 

20 In order to produce co- transformed yeast comprising both a PDI gene and a 
gene for a heterologous protein, WO 93/25676 taught that the two genes 
could be chromosomally integrated; one could be chromosomally integrated 
and one present on a plasmid; each gene could be introduced on a different 
plasmid; or both genes could be introduced on the same plasmid. WO 
25 93/25676 exemplified expression of antistasin from the plasmid pKH4a2 in 
yeast strains having a chromosomally integrated additional copy of a PDI 
gene (Examples 16 and 17); expression of antistasm from the vector K991 
with an additional PDI gene copy being present on a multicopy yeast shuttle 
vector named YEp24 (Botstein et al, 1979, Gene, 8, 1 7-24) (Example 20); 

3 
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and expression of both the coitistasin and ihe PDI g^es from the yeast- 
shuttle vector pCl/1 (Rosenberg et al, 1984, Nature, 312, 77-80) under 
control of the GAL10 and GAL I promoters, respectively. Indeed, 
Robinson and Wittrup, 1595, op. cit., also used the GAL1 - GAL10 
intergenic region to express erythropoietin and concluded that production 
yeast strains for the secretion of heterologous proteins should be 
constructed using tightly repressive, inducible promoters, otherwise the 
negative effects of sustained secretion (Le. lowered detectable BiP and PDI) 
would be dominant after the many generations of cell growth required to fill 
a large-scale fermenter. 

Subsequent work-in the field has identified chromosomal integration of 
transgenes as the key to maximising recombinant protein production. 

Robinson et al, 1994, op. cit., obtained the observed increases in expression 

of PDGF and S. pombe acid phosphatase using an additional 

chromosomally integrated PDI .gene copy. Robinson et al reported that 

attempts to use the multi-copy 2um expression vector to increase PDI 

protein levels had had a detrimental effect on heterologous protein 
secretion. 



Hayano et al, 1995, op. cit. described the introduction of genes for human 
lysozyme and PDI into a yeast host each on a separate linearised integration 
vector, thereby to bring about chromosomal integration. 

m 

Shusta et al, 1998, op. cit, reported that in yeast systems, the choice 
between integration of a transgene into the host chromosome versus the use 
of episomal expression vectors can greatly affect secretion and, with 
reference to Parekh & WIttrup, 1997, Biotechnol Prog,, 13, 117-122, that 
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stable integration of the scFv gene into the host chromosome using a 5 
integration vector was superior to the use of a 2fim-based expression 
plasmid. Parekh & Witirup, op. cit, had previously taught that the 
expression of bovine pancreatic trypsin inhibitor (BPTI) was increased by 
5 an order of magnitude using a 5 integration vector rather than a JjJtm-based 
expression plasmid. The 2|jm-based expression plasraid was said to be 
counter-productive for the production of heterologous secreted protein. 

Bao & Fufeuhara, 2001, op. cit, reported that d6 It was first thought that the 
10 KIPDI1 gene might be directly introduced into the multi-copy vector that 
carried the rHSA expression cassette. However, such constructs were found 
to severely affect yeast growth and plasmid stability. This confirmed our 
previous finding that the KWDI1 gene on a multi-copy vector was 
detrimental to growth of K. laciis cells (Bao et ah 2000)". Bao et al, 20Q0, 
15 Yeast, 16, 329-341, as referred to in the above-quoted passage of Bao & 
Fukuhata, reported that the KIPDI1 gene had been introduced into K. laciis 
an a multi-copy plasmid, pKan707 ? and that tire presence of the plasmid 
caused the strain to grow poorly. Bao et al concluded that over-expression 
of the JOPDU gene was toxic to K. laciis cells. In the light of the earlier 
20 findings in Bao et aU Bao & Fukuhara chose to introduce a single 
duplication of KIPD1 1 on tihe host chromosome. 

Against this background, we have surprisingly demonstrated that, contrary 
to the suggestions in the prior art, when the genes for a.chaperone protein 
25 and a heterologous protein axe co-expressed on a 2pin-family multi-copy 
plasmid in yeast, the production of the heterologous protein is substantially 

increased. 



5 
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DESCRIPTION OF THE INVENTION 



A first aspect of the present invention provides a method for producing 
heterologous protein comprising: 



(a) providing a host cell comprising a Sjam-family plasmid, the 
plasmid comprising a gene encoding a protein comprising the 
sequence of a chaperone protein and a gene encoding a heterologous 
protein; 

(b) culturing the host cell in a culture medium under conditions 
that allow the expression of the gene encoding the chaperone protein 
and the gene encoding a heterologous protein; 



15 (c) purifying the thus expressed heterologous protein from the 

culture medium; and 



(d) optionally, lyophilising the thus purified protein, 

20 In one embodiment, step (c) purifies the thus expressed heterologous 
protein to a pharmaceutical^ acceptable level of purity. 



Preferably, the method further comprises the step of formulating the 
purified heterologous protein with a carrier or diluent, such as a 
25 pharmaceutical^ acceptable carrier or diluent and optionally presenting the 
thus formulated protein in a unit dosage fomu 

A second aspect of the present invention provides for the use of a 2fim- 
family plasmid as an expression vector to increase the production of a 

6 
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fungal (preferably yeast) or vertebrate heterologous protein by providing a 
gene encoding the heterologous protein and a gene encoding a protein 
comprising the sequence of a chaperone protein on the same 2fxm-family 
plasmid. 

5 

A third aspect of die present invention provides a 2 ^m- family plasmid 
comprising a gene encoding a protein comprising the sequence of a 
chaperone protein and a gene encoding a heterologous protein, wherein if 
the plasmid is based on the 2pm plasmid then It is a disintegration vector. 

10 

A fourth aspect of the invention provides a host cell comprising a plasmid 
as defined above. 

The present invention relates to recombinantly modified versions of 2jnn- 
15 family plasmids. 

Certain closely related species of budding yeast have been shown to contain 
naturally occurring circular double stranded DNA plasmids- These 
plasmidss collectively termed 2pm-family plasmids, include pSRl 3 pSB3 

20 and pSB4 from Zygosa cck aromyces rouxii (formerly classified as 
Zygosaccharomyces bisporus\ plasmids pSBl and pSB2 from 
Zygosaccharomyces haiiii, plasmid pSMl from Zygosaccharomyces 
fermentaii, plasmid pKDt from Kluyveromyces drosphilamm* an un-naraed 
plasmid from Plchia memhranaefaciens (hereinafter "pPMl") and the 2pm 

25 plasmid and variants (such as Scpl, Scp2 and Scp3) from Saccharomyces 
cerevisiae (Volkert, et aL* 1989, Microbiological Reviews, S3 ? 299; Murray 
&i aL 7 1988, X MoL BioL 200, 601; Painting, et aL, 1984, J. Applied 
Bacteriology, 56, 331). 

7 
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As a family of plasmids these molecules share a series of common features 
in that they typically possess two Inverted repeats on opposite sides of the 
plasmid, have a similar size around 6-kbp (range 4757 to 6615-bp), three 
open reading frames, one of which encodes for a site specific iecombinase 
(FLP) and an autonomously replicating sequence (ARS) 9 also known as an 
origin of replication {ori) 7 located close to the end of one of the inverted 
repeats, (Futcher, 1988, Yeast, 4, 27; Murray & ah, op. tit* and Toh-e et 
al 7 1986, Basic Life Set 40 3 425), Despite their lack of discernible DNA 
sequence homology, their shared molecular architecture and the 
conservation of function of the three open reading frames have 
demonstrated a common ancestral link between the family members* 

Whilst any of the above naturally occurring 2pm-family plasmids can be 
used in the present invention, this invention is not limited to the use of 
15 naturally occurring 2^m-family plasmids. For the purposes of this 
Invention, a 2pm- family plasmid is as described below. 

A 2jj,m-family plasmid is a circular, double stranded, DNA plasmid. It is 
typically small, such as between 3,000 to 10,000 bp, preferably between 
20 4 ? 500 to 7000 bp, excluding recombinantly inserted sequences. 

A 2 jiim- family plasmid typically comprises at least three open reading 
frames C'ORFs'O that each encodes a protein that functions in the stable 
maintenance of the 2 -family plasmid as a multicopy plasmid. The 
25 proteins encoded by the three ORFs can be designated FLP 3 RBPL and 

REP2, Where a 2 [ma-family plasmid comprises not all three of the ORFs 
encoding FLP, REP1 and REP2 then ORFs encoding the missing pTOtein(s) 

should be supplied in trans, either on another plasmid or by chromosomal 

integration. 

S 
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A "FLP" protein is a protein capable of catalysing the site-specific 
recombination between inverted repeat sequences recognised by FLP, The 
inverted repeat sequences are tenhed FLP recombination target (FRT) sites 
5 and each is typically present as part: of a larger inverted repeat (see below). 
Preferred FLP proteins comprise the sequence of the FLP proteins encoded 
by one of plasmids pSRl 4 pSBl, pSB2, pSB3, pSB4 ? pSMl, pKDl, pPMl 
and the 2pm plasmid. for exainpjle as described in Volkert et a/„ op. ctt, 

Murray et al 7 op. cit., and Painting et al 9 op. cit* Variants and fragments of 

I 

10 these FLP proteins are also included in the present invention- "Fragments'" 
and "variants" are those which rjetain the abiEty of the native protein to 
catalyse the site-specific recombination between the same FRT sequences. 



Such variants and fragments will ujsually have at least 50%, 60% 9 70%, 80%, 
90%, 95%, 98%, 99%, or mora, homology with an FLP protein encoded by 
is one of plasmids pSRl, pSBl 3 pSE|2, P SB3 5 P SB4, pSMl, pJKDl, pPMl and 
the 2 Jim. plasmid. Different FLP proteins can have different FRT sequence 
specificities- A typical FRT site may comprise a core nucleotide sequence 



flanked by inverted repeat sequences. In "the 2\im plasmid, the FRT core 



sequence is 8 nucleotides in length and the flanking inverted repeat 
2D sequences ate 13 nucleotides in length (Volkert et al, op. cit). However the 
FRT site recognised by any given FLP protein may be different to the 2jim 
plasmid FRT site. 



KEPI and REP2 are proteins involved in the partitioning of plasmid copies 
25 during cell division, and may also have a role in the regulation of FLP 
expression. Considerable sequence divergence has been observed between 
REP1 proteins from different 2 pxm-femily plasmids, whereas no sequence 
alignment is possible between REP2 proteins derived from different 2pm- 
family plasmids. Preferred REP1 and REP2 proteins comprise the sequence 



*■ -.- l" I 
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of the KEPI and REF2 proteins encoded by one of plasmids pSRl, pSBl, 
P SB2, P SB3, pSB4, pSMl, pKDl, pPMl and the 2um plasmid, for example 
as described in Volfcert et al 9 op. ciL, Murray et al, op. cit., and Painting et 
al, op. cit. Variants and fragments of these KEPI and REP2 proteins are 
5 also included in the present invention, '^Fragments" and 'Variants* 5 of REP1 
and REP2 are those which, when encoded by the plasmid in place of the 
native OFF, do not substantially disrupt the stable multicopy maintenance of 
die plasmid within a suitable yeast population. Such variants and fragments 
of REP1 and RBP2 will usually have at least 5%, 10%, 20%, 30%, 40%, 
10 50%, 60%, 70%, 80%, 90%, 95%, 98%, 99%, or more, homology with a 
KEPI and REP2 protein, respectively, as encoded by one of plasmids pSRl, 
pSBl, pSB2, pSB3, pSB4, pSMl, pKDl, pPMl and the 2um plasmid. 



The RBP1 and REP2 proteins encoded by the ORFs on the plasmid rnnst be 
15 compatible. It is preferred that the REP1 and JREP2 proteins have the 
sequences of KEPI and REP2 proteins encoded by the same naturally 
occurring 2jjm-family plasmid, such as pSRl, pSBl, pSB2 7 pSB3, pSB4, 
pSMl, pKDl, pPMl and the 2jim plasmid, or variant or fragments thereof. 



20 A 2u.m-faroily plasmid typically comprises two inverted repeat sequences- 
The inverted repeats may be any size, so long as they each contain an FRT 
site (see above). The inverted repeats are typically highly homologous. 
They may share greater than 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 
9S%, 99%, 99.5% or more sequence identity. In a preferred embodiment 

25 they are identical. Typically the inverted repeats are each between 200 to 
1000 bp in length. Preferred inverted repeat sequences may each have a 
length of from 200 to 300 bp, 300 to 400 bp, 400 to 500 bp, 500 to 600 bp, 
600 to 700 bp, 700 to 800 bp, 800 to 900 bp, or 900 to 1000 bp. 
Particularly preferred inverted repeats arc those of the plasmids pSRl (959 

10 
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bp), pSBi (675 bp), pSB2 (477 bp), pSB3 (391 bp), pSMl (352 bp), pKDl 
(346 bp), the 2\xm plasmid (599 bp), pSB4 or pPMl. 

The sequences of the inverted repeats may be varied. However, the 
5 sequences of the ERT site in each inverted repeat should be compatible with 
the specificity of the FLP protein encoded by the plasmid, thereby to enable 
the encoded FLP protein to act to catalyse the site-specific recombination 
between the inverted repeat sequences of the plasmid. Recombination 
between inverted repeat sequences (and thus the ability of the FLP protein 

10 to recognise the FRT sites with the plasmid) can be determined by methods 
known in the art For example, a plasmid in a yeast cell under conditions 
that favour FLP expression can be assayed for changes in the restriction 
profile of the plasmid which would result from a change in the orientation 
.. of a region of the plasmid relative to another region of fhe plasmid. The 

15 detection of changes in restriction profile indicate that the FLP protein is 
able to recognise the FRT sites in the plasmid and therefore that the FRT 
site in each inverted repeat are compatible with fhe specificity of the FLP 
protein encoded by fhe plasmid. 

20 In a particularly preferred embodiment, fhe sequences of inverted repeats, 
including fhe FRT sites, are derived from the same 2Mm-farnily plasmid as 
the ORF encoding the FLP protein, such as pSRl, pSBl, pSB2, pSB3, 
pSB4, pSMl, pKDl, pPMl or fhe 2um plasmid. 

25 The inverted repeats are typically positioned with the 2pm-fami]y plasmid 
such that the two regions defined between the inverted repeats (e,g r such as 
defined as UL and US in the 2um plasmid) are of approximately similar 
size, excluding exogenously introduced sequences such as transgenes. For 
example, one of the two regions may have a length equivalent to at least 

11 
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40%, 50%, 60%, 70%, 80%, 90%, 95% or more, up to 100%, of the length ^ 
of the other region. 



A 2 fun-family plasmid typically comprises the ORF that encodes FLP and 
5 one inverted repeat (arbitrarily termed "IR1" to distinguish it from the other 
inverted repeat mentioned in the next paragraph) juxtaposed in such a 
manner that IR1 occurs at the distal end of the FLP ORF, without any 
intervening coding sequence, for example as seen in the 2/am plasmid. By 
"distal end" in this context we mean the end of the FLP ORF opposite to the 
10 end from which the promoter initiates its transcaption. In a preferred 
embodiment, the distal end of the FLP ORF overlaps with XRL 



A 2jim-family plasmid typically comprises the ORF that encodes REP2 and 
die other inverted repeat (arbiteafily termed "IR2" to distinguish it from IR1 
15 mentioned hi the previous paragraph) juxtaposed in such a manner that IR2 
occurs at the distal end of the REP2 ORF, without any intervening coding 
sequence, for example as seen in the 2um plasmid. By "distal end" in this 
context we mean the end of tire REP2 ORF opposite to the end from which 
the promoter initiates its transcription. 

20 

In one embodiment, the ORFs encoding REP2 and FLP may be present on 
the same region of the two regions defined between the inverted repeats of 
the 2f.im-family plasmid, which region may be the bigger or smaller of the 
regions (if there is any inequality in size between the two regions). 

25 ~ 

In one embodiment, the ORFs encoding REP2 and FLP may be transcribed 
from divergent promoters. 
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Typically, the regions defined between the inverted repeats (e.g, suck as 
defined as UL and US in the 2^in plasmid) of a 2pm-family plasmid may 
comprise not more than two endogenous genes that encode a protein that 
functions in the stable maintenance of the 2^m-family plasmid as a 
5 multicopy plasmid. Thus in a preferred embodiment, one region of the 
plasmid defined between the inverted repeats may comprise not more than 
the ORFs encoding FLP and REP2; FLP and REP1; or REP1 and REP2, as 
endogenous coding sequence. 

10 A 2pm.- family plasmid typically comprises an origin of replication (also 
known as an "autonomously replicating sequence - "ARS"), which is 
typically bidirectional. Any appropriate ARS sequence can be present 
Consensus sequences typical of yeast chromosomal origins of replication 
maybe appropriate (Broach et at, 1982, Cold Spring Harbor Symp. Quant. 

15 3ioL 7 47, 1165-1174; Williamson, Yeast, 1985, 1, 1-14)- Preferred ARSs 
include Ihose isolated from pSRl, pSBl, pSB2, pSB3 3 pSB4, pSMl ? pKDl, 
pPMl and the 2]Uim plasmid. 

Thus, a preferred 2pm-family plasmid may comprise ORFs encoding FU% 
20 RBP1 and REP2, two inverted repeat sequences each inverted repeat 
comprising an FRT site compatible with the encoded FLP protein, and an 
ARS sequence. Preferably the FRT sites arc derived from the same 2pm- 
family plasmid as the sequence of the encoded FLP protein. More 
preferably the sequences of the encoded REP1 and REP2 proteins are 
25 derived from the same 2pan-family plasmid as each other. TBven more 
preferably, the FRT sites are derived from the same 2^m-family plasmid as 
the sequence of the encoded FLP, REP1 and REP2 proteins. Yet more 
preferably, the sequences of the ORFs encoding FLP, REP1 and K.EP2, and 

the sequence of the inverted repeats (including the FRT sites) are derived 

13 
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from the same 2 j^m- family plasmid. Furthermore, the ARS site may be 
derived from the same 2p.m~fami.ly plasmid as one or more of the ORFs of 
FLP ? EJEP1 and REP2, and the sequence of the inverted repeats (including 
the FRT sites). 

5 

The term "derived from" includes sequences having an identical sequence 
to the sequence from which they are derived. However, variants and 
fragments thereof, as defined above, are also included. For example, an 
FLP gene having a sequence derived from the FLP gene of the 2 Jim 

10 plasmid may have a modified promoter or other regulatory sequence 
compared to that of the naturally occurring gene. Additionally or 
alternatively, an FLP gene having a sequence derived from the FLP gene of 
the 2pm plasmid may have a modified nucleotide sequence in the open 
reading frame which may- encode the same protein as the naturally 

15 occurring gene, or may encode a modified FLP protein. The same 
considerations apply to other sequences on a 2jjjn-family plasmid having a 
sequence derived from a particular source. 



Optionally, a 2(xm-family plasmid may comprise a region derived from the 
20 STB region (also known as REP3) of the 2>im plasmid, as defined in 

Volkert et al, op. cit The STB region in a 2pm~family plasmid of the 
invention may comprise two or more tandem repeat sequences, such as 
three, four, five or more. Alternatively, no tandem repeat sequences may be 
present. The tandem repeats may be any size, such as 10 3 20, 30-, 40, 50, 60 
25 70 ? SO, 90, 1 00 bp or more in length. The tandem repeats in the STB region 
of the 2fim plasmid are 62 bp in length. It Is not essential for the sequences 
of the tandem repeats to be identical. Slight sequence variation can be 
tolerated. It may be preferable to select £n STB region from the same 

14 
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plasmid as either or both of ihe REP1 and REP2 ORFs. The STB region is 
thought to "be a c&-aetmg element and preferably is not transcribed. 

i 

Optionally, a 2pm-family plasmid may comprise an additional GRF that 
5 encodes a protein that functions in the stable maintenance of the 2 pm- 
famxly plasmid as a multicopy plasmid. The additional protein can be 
designated RAF or D. ORFs encoding the RAF or D gene can be seen cm, 
for example, the 2pm plasmid and pSML Thus a RAF or D ORF can 
comprise a sequence suitable to encode the protein product of the RAF or D 

10 gene ORFs encoded by the 2pm plasmid or pSMl, or variants and 
fragments thereof. Thus variants and fragments of the protein products of 
the RAF or D genes of the 2\xnx plasmid or pSMl are also included in the 
present invention. "Fragments" and "variants" of the protein products of the 
RAF or D genes of the 2pm plasmid or pSMl are those which, when 

IS encoded by the 2pm plasmid or pSMl in place of the native ORF, do not 
disrupt the stable multicopy maintenance of the plasmid within a suitable 
yeast population. Such variants and fragments will usually have at least 5%, 
10% 5 20%, 30%, 40% ? 50%, 60%, 70% a 80%, 90%, 95%, 98%, 99%, or 
more, homology with the protein product of the RAF or E> gene ORFs 

20 encoded by the 2|um plasmid or pSMl, 

A naturally occurring 2pm-family plasmid may be preferred, A naturally 
occurring 2p.m-family plasmid is any plasmid having the features defined 
above, which plasmid is found to naturally exist in yeast, i.e. has not been 
25 recombinantly modified to include heterologous sequence- Preferably the 
naturally occurring 2 (Um- family plasmid is selected from pSRl (Accession 
No. X02398), pSB3 (Accession No. X0260S) or pSB4 as obtained from 
Zygosaccharomyces rouxiU pSBl or pSB2 (Accession No. NCJ302055 or 

15 
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M18274) both as obtained from Zygosaccharomyces baiUi, pSMl ^ 
(Accession No. NCj)02054) as obtained from Zygosacckaromyaes ^ 
fermentati, pKDl (Accession No, X03961) as obtained from 
Kluyveromyces drosophilarum, pPMl from Pichia membranaefaciens or 
most preferably, the 2pm plasmid (Accession No. NC_001398 or J01347) 
as obtained from Saccharomyces cerevmae. Accession numbers in this 
paragraph refer to NCBI deposits. 

The 2um plasmid (Figure 1) is a 6,31S-bp double-stranded DNA plasmid, 
endogenous in most Saccharomyces cerevisiae strains at 60-100 copies per 
haploid genome. The 2pm plasmid comprises a small unique (US) region 
and a large unique (UL) region, separated by two 599-bp inverted repeat 
sequences. Site-specific recombination of the inverted repeat sequences 
results in inter-conversion between the A-form and B-form of the plasmid 
/« vivo (VoJkert & Broach, 198fi, Cell. 46, 541). The two forms of 2um differ 
only in the relative orientation of their unique regions. 

While DNA sequencing of a cloned 2p.ni plasmid (also known as Scpl) 
from Saccharomyces cerevisiae gave a size of 6,318-bp (Hartley and 
Donelson, 1980, Nature, 286, 860), oilier slightly smaller variants of 2pm, 
Scp2 and Scp3, are known to exist as a result of small deletions of 125-bp 
and 220-bp, respectively, in a region known as STB (Cameron et aL, 1977, 
Nucl. Acids Res., 4, 1429: Kikuchi, 19S3, Cell, 35, 487 and Livingston & 
Hahne, 1979, Proc. Natl. Acad. ScL USA, 76, 3727). In one study about 
80% of natural Saccharomyces strains from around the world contained 
DNA homologous to 2um (by Southern blot analysis) (Hollenberg, 1982, 
Current Topics in Microbiology and Zmmunobioiogy, 9€, ll£>). 
Furthermore, variation (genetic polymorphism) occurs within the natural 
population of 2um plasmids found in S. cerevisiae and S. carlsbergensis. 
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with title NCBI sequence (accession number NC 001398) being one 
example. 

The 2]xm plasmid has a nuclear localisation and displays a high level of 
5 mitotic stability (Mead et al* 1986, Molecular General Genetics, 205, 
417). The inherent stability of the 2(mi plasmid results from a plasmid- 
en coded copy number amplification and partitioning mechanism, which can 
be compromised during the development of chimeric vectors (Futcher & 
Cox, 1984,*/. Bacterial* 157, 283; Baohmair $l Ruis, 1984, Mon&tshefie fur 
10 Chemie, 115, 1229). A yeast strain, which contains a 2fim plasmid is 
known as [cir + ], while a yeast strain which does not contain a 2\im plasmid 
is known as [cir 0 ]. 

The US-region of the 2pm plasmid contains the REP2 and FLP genes, and 
15 the UL-region contains the REP I and D (also known as RAF) genes, the 
STE-locus and the origin of replication (Broach & Hicks, 1980, Cell* 21, 
501; Sutton & Broach, 1985, Mol Cell Biol, 5, 2770). The Flp 
recombinase binds to FRT-sites (Flp Recognition target) within the 
inverted repeats to mediate site-specific recombination, which is essential 
20 for natural plasmid amplification and control of plasmid copy number in 
vivo (Senecoff et al 9 1985, Proa. Nail Acad. Set U.S. A., 82, 7270; Jayararo, 
1985, Proc. Natl Acad. ScL CJ.S.A* 82, 5875). The copy number of 2jmi- 
family plasmids can be significantly affected by changes hi Flp 
recombinase activity (Sleep et af> 200 1 3 Yea$i y IS, 403; Rose & Broach, 
25 1990, Methods Enzymol* 185, 234). The Repl and Rep2 proteins mediate 
plasnaid segregation, although their mode of action is unclear (Sengupta et 
al 9 2001, J. Bacterial* 183, 2306). They also repress transcription of the 
FLP gene (Reynolds et al 1987, Mol Ceil. Bid, 7, 3*566). 

17 
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The FLP and REP2 genes of the 2ju.ni plasmid are transcribed from 
divergent promoters, with apparently no intervening sequence defined 
between them. The FLP and REP2 transcripts both terminate at the same 
sequence motifs within the inverted repeat sequences, at 24-bp and 178-bp 
respectively after their translation termination oodons (Sutton & Broach, 
1985, Mol Cell Biol, 5, 2770), 



In the case of FLP, the C-texminal coding sequence also lies within the 
inverted repeat sequence. Furthermore;, the two inverted repeat sequences 

10 are highly conserved over 599-bp, a feature considered advantageous to 
efficient plasmid replication and amplification in vivo, although only the 
FRT-sites (less than 65-bp) are essential for site-specific recombination in 
vitro (Senecoff et ol, 1925, Proc. Natl Acad. Set ZL8*A. 9 82, 7270; 
Jayararn, 1985, Proc. Nad. Acad. ScL S2, 5S75; Meyer-Leon et al 7 

15 1984, Cold Spring Harbor Symposia On Quantitative Biology, 49, 797). 
The key catalytic residues of Flp are arginine-308 and tyrosme-343 (which 
is essential) with strand-cutting facilitated by histidine-309 and histidme 
345 (Prasad et al* 19S7, Proc. Natl Acad. Sci. U.SA^ 84, 21S9; Chen et al ? 
1992, Cell, 69, 647; Grainge etal 2001, J. Mol BioL, 314, 717). 

20 

Two functional domains are described in Rep2. Residues 15-58 form a 
Rep 1 -binding domain, and residues 59-296 contain a self-association and 
STB-binding region (Seogupta et al, 2001, J. BacterioL, 1,83, 2306). 

25 Chimeric or large deletion mutant derivatives of 2|um which Kick many of 
. the essential functional regions of the 2fjim plasmid but retain the functional 
cis element ARS and STB, cannot effectively partition between mother and 
daughter cells at cell division. Such plasmids can do so if these functions 
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are supplied in trans, by for instance the provision of a functional 2pm 
plasmid within the host, such as a [cir + ] host 

Genes of interest have previously been inserted into the UL-region of the 
5 2(JLtn plasmid- For example, see plasmid pSAC3Ul in EP 0 286 424 and the 
plasmid shown in Figure 2 9 which includes a lactamase gene (for 
ampicillin resistance), a LEU2 selectable marker and an oligonucleotide 
linker, the latter two of which are inserted into a unique 5«aBI-site within 
the UL-region of the 2^im-like disintegration vector 3 pSAC3 (see EP 0 286 
10 424) T The E, coli DNA between the Jftal-sites that contains the ampicillin 
resistance gene is lost from the plasmid shown in Figure 2 after 
transformation into yeast This is described in Chinery & Hinchliffe, 1989, 
Curr. Genet, 16, 21 and EP 0 286 424, where these types of vectors are 
designated "disintegration vectors". Further polynucleotide insertions can 
15 be made in a JVctfl-site within a linker (Sleep et aU I99l 7 Biotechnology (N 
V, % 183). 

Alternative insertion sites in 2^m plasmid arc known m the art, including 
those described in Rose & Broach (1990, Methods gnzymol, 185, 234-279), 

20 such as plasmids pCV19, pCV20, CV nfi:D , which utilise an insertion at EcoSl 
in FLP, plasmids pCV21, pGT41 and P YE which utilise EcoBI m D as the 
insertion site, plasmid pHKB52 which utilises Pstl in D as the insertion site, 
plasmid p JDB248 which utilises an insertion at Pstl in D and EcoRl in D, 
plasmid pJDB219 in which Pstl in D and EcoM in FLP are used as 

25 insertion sites, plasmid GIB, plasmid pABIS which utilises an insertion at 

Clal in FLP, plasmids pGT39 and pA3 ? plasmids pYTll, pYT14 and 

pYTll-LEU which use Pstl m D as the insertion site, and plasmid PTY39 

which uses EcdRI in FLP as the insertion site. Other 2\im plasmids include 

pSAC3 ? pSAC3Ql, pSAC3U2, pSAC300, pSAC310 ? pSAC3Cl, 

19 
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pSACSPLl, pSAC3SL4, and pSAC3SCl are described in EP 0 286 424 and 
Chinery & Hinchliffe (1989, Curr. Gen&U 16* 21-25) which also described 
PsfL, Eagl or SndBI as appropriate 2\im insertion sites. Further 2pm 
plasmids include pAYE255, pAYE316 3 pAYE443, pAYE522 (Kerry- 
5 Williams et al f 1998, Yeast, 14, 161169), P DB2244 (WO 00/44772), and 
PAYE329 (Sleep et al, 2001, Yeast, 1S ? 403-421). 



In one preferred embodiment, one ox more genes are inserted into a 2nm- 
family plasmid within an untranscribed region around the ARS sequence, 

10 Fox example, in the 2pm plasmid obtained from $. c&revisia^ the 
untranscribed region around the ARS sequence extends from end of the D 
gene to the beginning of ARS sequence. Insertion into SndBI (near the 
origin of replication sequence ARS) is described in Chinery & Hinchliffe, 
19B9* Curr. Genet., 16, 21-25. !Mie skilled person will appreciate that gene 

15 insertions can also be made in the nntransexibed region at neighbouring 
positions to the SnaRI site described in Chinery & Hinchliffe. 

In another preferred embodiment, REP2 and FLP genes in a 2jxm-family 
plasmid each have an inverted repeat adjacent to them, and one or more 

20 genes are inserted into a 2^m-family plasmid within the region between the 
first base after the last functional codon of either the REP2 gene or the FLP 
gene and the last base before the FRT site in the inverted repeat adjacent to 
said gene, The last functional codon of either a REP2 gene or a FLP gene is 
the codon in the open reading frame of the gene that is furthest downstream 

25 from the promoter of the gene whose replacement by a stop codon will lead 
to an unacceptable loss of multicopy stability qf the plasmid, as defined 
herein. Thus, disruption of the REP2 or FLP genes at any point 
downstream of the last functional codon in either gene, by insertion of a 
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polynucleptide sequence insertion, deletion or substitution will not lead to 
an Unacceptable loss of multicopy stability of the plasmid. 



For example, the REP2 gene of the 2\im plasmid can be disrupted after 
5 codon 59 and that the FLP gene of the 2jjlth plasmid can be disrupted after 
codon 344, each without a loss of multicopy stability of the plasmitL The 
last functional codon in equivalent genes in other 2nm-family plasmids can 
be determined routinely by making mutants of the plasmids in either the 
FLP or REP2 genes and following the tests set out herein to determine 
lo whether the plasmid retains multicopy stability. 

One can determined whether a plasmid retains multicopy stability using 
test such as defined in Chinery & Hinchliffe (19S9, Cum Genet., 16, 21- 
25). For yeast that do not grow in the^rion-selective media (YPD, also 

15 designated YEPD) defined in Chinery & Hinchliffe (1989, Curr. GeneL, 16, 
21-25) other appropriate non-selective media might be used. Plasmid 
stability may be defined as the percentage cells remaining prototrophic for 
the selectable marker after a defined number of generations, the number of 
generations will preferably be sufficient to show a difference between a 

20 control plasmid, such as pSAC35 or pSAC310, or to shown comparable 
stability to such a control plasmid. The number of generations may be 1 3 2 a 
3, 4, 5, 6 9 7, 8, 9 ? 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 
45, 50, 60, 70, 80, 90, 100 or more. Higher numbers are preferred. The 
acceptable plasmid stability might be 1%, 2%, 3%, 4%, 5%, 10%, 15%, 

25 20%, 25%, 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 
97%, 9S%, 99%, 99.9% or substantially 100%- Higher percentages are 
preferred. The skilled person will appreciate that, even though a plasmid 
may have a stability less than 100% when grown on non-selective media, 
that plasmid can still be of use when cultured in selective media. For 

21 
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example plasmid pDB2711 as described in the examples is only 10% stable 
when the stability is determined accordingly to test of Example 1, but 
provides a 15-fold increase in recombinant transferrin productivity in shake 
flask culture under selective growth conditions. 

5 

Thus one or more gene insertions may occur between the first base after the 
last functional codon of the REP2 gene and the last base before me FRT site 
in an inverted repeat adjacent to said gene, more preferably between the 
first base of the inverted repeat and the last base before the FRT site, even 
10 more preferably at a position after me translation termination codon of the 
REP2 gene and before the last base before the FRT site. 

Additionally or alternatively one or more gene insertions may occur 
between the first base after the last functional codon,of the FLP gene and 
15 the last base before the FRT site in an inverted repeat adjacent to said gene, 
preferably between the first base of the inverted repeat and the last base 
before the FRT site, more preferably between the first base after the end of 
the FLP coding sequence and the last base before the FRT site, such as at 
the first base after the end of the FLP coding sequence. 



20 



25 



In one preferred embodiment, where the 2um-family plasmid is based on 
the 2 urn plasmid of & cerevisiae, it is a disintegration vector as known in 
the art (for example, see EP 286 424, the contents of which are incorporated 
herein by reference). A disintegration vector may be a 2um plasmid vector 
comprising a DNA sequence which is intended to be lost by recombination 
three 2pm FRT sites, of which one pair of sites is in direct orientation and 
the other two pairs are in indirect orientation, and a DNA sequence of 
interest (such as an K eoli origin of replication and bacterial selectable 



22 



23/12 '03 13; 09 FAX 01159 552201 



ERIC POTTER CLARK SON 



121028 



marker), the said sequence to be lost being located between the said sites 
which are in direct orientation- 

■ 

Thus,, the sequence to be lost may comprise a selectable marker DNA 
5 sequence. 

A preferred disintegration vector comprises a complete 2 pm plasmid 
additionally caiiying (i) a bacterial plasmid DNA sequence necessary for 
propagation of the vector in a bacterial host; (ii) an extra 2|Lim FRT site; and 

10 a selectable marker DNA sequence for yeast transformation; the said 
bacterial plasmid DNA sequence being present and the extra FRT site being 
created at a restriction site, such as J£bal„ in one of the two inverted repeat 
sequences of the 2um plasmid, the said extra FRT site being in direct 
orientation in relation to the endogenous FRT site of the said one repeat 

15 sequence, and the bacterial plasmid DNA sequence being sandwiched 
between the extra FRT site and the endogenous FRT site of the said one 
repeat sequence. In a preferred disintegration vector, all bacterial plasmid 
DNA sequences are sandwiched as said. A particularly preferred 2pm 
plasmid vector has substantially the configuration of pSAC3 as shown in EP 

20 286 424. 

The term "disintegration vector" as used herein also includes piasmids as 
defined in US 6,451,559, the contents of which are incorporated herein by 
reference. Thus a disintegration vector may be a 2pm- vector that, other 
25 than DNA sequence encoding non-yeast polypeptides, contains no bacterial 
(particularly K coli) origin of replication, or more preferably no bacterial 
(particularly E. coli) sequence and preferably all DNA in said vector, other 
than DNA sequence encoding n on -yeast polypeptides, is yeast-derived 
DNA. 

23 
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The term "chaperone" as used herein refers to a protein that binds to and ^ 
stabilises an otherwise unstable conformer of another protein, and by 
controlled binding and release, facilitates its correct fate in vivo, be it 
folding, oligomers assembly, transport to a particular subcellular 
compartment, or disposal by degradation. Accordingly a chaperone is also 
a protein that is involved in protein folding, or which has chaperone activity 
or is involved in the unfolded protein response. Chaperone proteins of this 
type are known in the art, for example in the Stanford Genome Database 
(SGD), htlp:://db.yeastgenome.org. Preferred chaperones are eukaryotic 
chaperones, especially preferred chaperones are yeast chaperones, including 
AHA1, CCT2, CCT3, CCT4, CCT5, CCT6, CCT7, CCT8, CAE/, CPR3, 
CPR6, EROl, EUG1, FMOl, HCffl, HSP1Q, HSP12, HSP104 t HSP26, 
HSP30, HSP42, HSP60, HSP7S, HSP82, JEMJ, MDJ1, MDJ2, MPIZ1, 
MPD2, PDll, PFD1, ABC1, APJ1, ATP11, ATP12, BTTl, CDC37, CPR7, 
HSC82, KAR2, LHS1, MGE1, MRSI1, NOBI, ECM1Q, SSA1, SSA2, SSA3, 
SSA4, SSCI, SSE2, SEL1, SLS1, ORM1, ORM2, PERI, PTC2, PSE1, UBI4 
and HAC1 or a truncated intronless HAC1 (Valkonen ei al. 2003, Applied 
Environ, Micro., 69, 2065) 

A preferred chaperone is protein disulphide isomerase (PDI) or a fragment 
or variant thereof having an equivalent ability to catalyse the formation of 
disulphide bonds within the lumen of the endoplasmic reticulum (ER). By 
"PDI" we include any protein having the ability to reactivate the 
ribonucleic activity against RNA of scrambled ribonucl ease .as described 
in EP 0 746 611 and Hillson ei al, 1984, Methods Enzymol. , J 07, 281-292. 

PDI is an enzyme which typically catalyzes thiohdisulphide interchange 
reactions, and is a major resident protein component of the ER lumen in 

24 
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secretory cells. A body of evidence suggests that it plays a role in secretory 
protein biosynthesis (Freedman, 1984, Trends Biochem, Sci., 9, 438-41) and 
this is supported by direct cross-linking studies in situ (Roth and Pierce, 
1987, Biochemistry, 26, 4179-82). The finding that microsomal membranes 

5 deficient in PDI show a specific defect in cottanslational protein disulphide 
(Bulleid and Freedman, 1988, Nature, 335, 649-51) implies that the enzyme 
functions as a catalyst of native disulphide bond formation during the 
biosynthesis of secretory and cell surface proteins. This role is consistent 
with what is known of die enzyme's catalytic properties in vitro; it catalyzes 

10 thiol: disulphide interchange reactions leading to net protein disulphide 
formation, breakage or isomerization, and can typically catalyze protein 
folding and the formation of native disulphide bonds in a wide variety of 
reduced, unfolded protein substrates (Freedman et at, 1989, Biochem. Soc. 
Sympl, 55, 167-192). The DNA and amino acid sequence of the enzyme is 

t5 known for several species (Scherens et al, 1991, Yeast, 7, 185-193; 
Farquhar et al, 1991, Gene, 108, Sl-89; EP074661; EP0293793; 
EP0509841) and there is increasing information on the mechanism of action 
of the enzyme purified to homogeneity from mammalian liver (Creighton et 
al, 1980, J. Mol Biol, 142, 43-62; Freedman et al, 19S8, Biochem. Soc. 

20 Trans., 16, 96-9; Gilbert, 1989, Biochemistry, 28, 7298-7305; Lundstrom 
and Holmgren, 1990, J. Biol. Ckem., 265, 9114-9120; Hawkins and 
Freedman, 1990, Biochem, J., 275, 335-339). Of the many protein factors 
currently implicated as mediators of protein folding, assembly and 
translocation in the cell (Rothman, 1989, Cell, 59, 591-601), PDI lias a 

25 well-defined catalytic activity. 4 



PDI is readily isolated from mammalian tissues and the homogeneous 
enzyme is a homodimer (2x57 kD) with characteristically acidic pi (4.0- 
4.5) (Hillson et al, 1984, op. cit.). The enzyme has also been purified from 

25 
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wheat and from the alga Cklamydomonas reinhardii (Kaska et al, 1990, ^ 
Biochem. J., 26S, 63-68), rat (Edman et al, 1985, Nature, 317, 267-270),' ^ 
bovine (Yamaucbi et al, 1987, Biochem. Biophys. Res. Comm., 146, 1485- 
1492), human (Pihlajamend et at, 19S7, EMBO J,, 6, 643-9), yeast 
(Scherens et al, suprar, Farquhar et al, op. cit.) and chick (Parkkonen et al, 
1988, Biochem. J., 256, 1005-1011). The proteins from these vertebrate 
species show a high degree of sequence conservation throughout and all 
show several overall features first noted in the rat PDI sequence (Edman et 
at., 1985, op. c£t.). 

Prefeired PDI sequences include those from humans and tihose from yeast 
species, such as S. cerevisiae. 

A yeast protein disulphide isomerase precursor, EDM. can be found as 
Genbank accession no. CAA42373 or BAA00723. It has tire following 
sequence of 522 amino acids: 

1 uikf sagavls wsslllassv fac^avape ^wklatd S £ My ±q S hd l^laeffapw 
SI eghaknraape yvkaaetlva knifclaqid c te aqdlc«neh ni P0 £ ps lki £k ils dvxm^i 
121 dyagprtaea iv<,£ m ikq S q p^wadXp ayWtfv* pv iv<Isg kAa adfuatfyem 
1B1 ankhfndydf vsaenadddf klsiylpeam depwyagkk ad±adad V £e k^lqrealpy 

241 fg^idgsvja gyv^^g ylfYridseel eeykplftel akknrglnmf V6idarkfgr 
3 01 faagalrvr.keot fpifaihdmt edlkyglpql seeafdelsd kivieskaia slvkdfikgd 
361 aepi^ksgei fenqdsavfg Ivgknhdaiv ndpkkdvlvl yyapwugbok r la P t yiIe l a 
431 dCyanatsdv liakldhten dvrgwiegy ptivlypggk kseswyqgs orsldslfdfi 
*B1 kanghfdvdg kalyeeaqek aaeeadadae ladeedaihd el 



An alternative yeast protein disulphide isomerase sequence can be found as 
Genbank accession no. CAA38402. It has the following sequence of 530 
30 amino acids 
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1 mkf sag-avis -wsslllassv f aqqfccLvape dsawklatd s3£rj.eyi<£Slid lvlaeff apw 
6T1 cghctaunape yvkaaetlve knitlaqidc tenqdlcmeb. nipgfpslki fknrdvnnsi 
121 dyegprtaea Ivgfmikqscc pavarvadlp aylazietfvt pvivqsgKid adfnatfysm 
181 ankhfndydf vsaenadddf klsiylpgam depwynglA adiadadvfe fcwlgyealpy 
241 fgeid^vfa qyvesglplg ylfyndeeel eeykplftal ^kknxrglnuaf Vsida^kfgr- 
301 hagalmrkeq fplfaihdmt edlkyglpql seeaf&elsd kivleskaie slvkdflkgd 
361 acpivksqei feii^dssvfq lvgknludeiv ndpkkdvlvl yyapwcg-lick rlapcyqela 
421 dtyanatsdv liakldhten dvrgwiegy ptivlypggk kseswyqg§ rsld3lfdf i 
4G1 kenglifdvdg kalyeeaqek aaeeaeadae aeadadaela deedaiJi^el 



Variants and fragments of the above PDI sequences, and variants of other 
naturally occurring PDI sequences are also included in the present invention. 
A 'Variant", in the context of PDL, refers to a protein wherein at one or more 
positions there have been amino acid insertions., deletions, or substitutions, 

15 either conservative or non-conservative, provided that such changes result in a 
protein whose basic properties, for example enzymatic activity (type of and 
specific activity), thermostability, activity in a certain pH-range (pH-stability) 
have not significantly been changed. "Significantly" in this context means that 
one skilled in the art would say that the properties of the variant may still be 

20 different but would not be unobvious over the ones of the original protein. 

By "conservative substitutions" is intended combinations such as VaL He, 
Leu, Ala, Met; Asp, Glu; Asn, Gin; Ser, Tbr, Gly, Ala; Lys, Arg, His; and 
Phc, Tyr, Trp. Preferred conservative substitutions include Gly, Ala; Val, lie, 
25 Leu; Asp, Glu; Asn, Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. 

A "variant" typically has at least 25%, at least 50%, at least 60% or at least 
70%, preferably at least S0%, more preferably at least 90%, even more 
preferably at least 95%, yet more preferably at least 99%, most preferably at 
30 least 99.5% sequence identity to the polypeptide from which it is derived. 
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The percent sequence identity between two polypeptides may be determined 
using suitable computer programs, as discussed below. Such variants may 
be natural or made using the methods of protein engineering and site^directed 
mutagenesis as are well known in the art 

A 'fragment", in the context of PDI, refers to a protein wherein at one or more 
positions there have been deletions. Thus the fragment may comprise at most 
5, 10, 20, 30, 40 or 50% of the complete sequence of the full mature PDI 
protein. Typically a fragment comprises up to 60%, more typically up to 
70%, preferably up to 80%, more preferably up to 90%, even more preferably 
up to 95%, yet more preferably up to 99% of the complete sequence of the full 
PDI protein. Particularly preferred fragments of PDI protein comprise one or 
more whole domains of the desired protein. 

is A gene encoding a protein comprising the sequence of a chaperone may be 
formed in a like manner to that discussed below for genes encoding 
heterologous proteins, with particular emphasis on combinations of ORFs 
and regulatory regions. 



20 



20 



25 



ine term protein as used herein includes all natural and non-natural 

proteins, polypeptides and peptides. A heterologous protein" is a protein 

that is not naturally encoded by a 2um-family plasmid. Preferably, 

therefore, the heterologous protein is not a FLP, REPl, REP2, or a RAF/D 

protein as encoded by any one of pSRl, P SB3 or P SB4 as obtained from Z 

rouxii, pSBl or pSB2 both as obtained from K baitti, pSMl^as obtained 

from Z. fermentati, pKDl as obtained from K. drosophilarum, pPMl as 

obtained from P. membranaefaeiens or the 2pm plasmid as obtained from 
& cerevisiae. 
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A gene encoding a heterologous protein comprises polynucleotide sequence 
encoding the heterologous protein (typically according to standard codon 
usage for any given organism), designated the open reading frame ("ORF"). 
The gene may additionally comprise some polynucleotide sequence that 
5 does not encode an open reading frame (termed e ^ion-coding region"). 

Non-coding region in the gene may contain one or more regulatory 
sequences, operatively linked to the ORF, which allow for fhe transcription 
' of the open reading frame and/or translation of the resultant transcript 

10 

The term "regulatory sequence" refers to a sequence that modulates (i.e., 
promotes or reduces) the expression (he., the transcription and/or 
translation) of an ORF to which it is operably linked. Regulatory regions 
typically include promoters, terminators, ribosome binding sites and the 
15 like. The skilled person will appreciate that the choice of regulatory region 
will depend upon the intended expression system. For example, promoters 
may he constitutive or inducible and may be cell- or tissue-type specific or 
non-specific. 

20 Suitable regulatory regions, may be 5bp, 10bp, 15bp, 20bp, 25bp, 30bp, 
35bp, 40bp, 45bp, 50bp, 60bp, 70bp, SObp, 90bp, lOObp, 120bp, 140bp, 
160bp, 180bp, 200bp, 220bp, 240bp, 2(5Qbp, 280bp, 300bp, 350bp, 400bp, 
450bp, 500bp, 550bp, 600bp, 6S0bp, 700bp, 750bp, SOObp, 850bp, 900bp, 
950bp, lOOObp, llOObp, 1200bp, 1300bp, WOQpp, 1500bp or greater, in 

25 length. * 

Those skilled in the art will recognise that the gene encoding the chaperons, 
for example PDI, may additionally comprise non-coding regions and/or 
regulatory regions. Such non-coding regions and regulatory regions are not 
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restricted to the native non-coding regions and/or regulatory regions 
normally associated with die ohaperone ORR 

Where the expression system is yeast, such as Saccharomyces cerevisiae, 
5 suitable promoters for S. cerevisiae include those associated with the PQK1 
gene, GAL1 or GAL JO genes, TEF1, TEF2> PYK1 3 PMA1, CYCf, PROS, 
TRPI, ADW, ADII2, the genes for glyeeraldehyde-3-phosphate 
dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokraase, 
triose phosphate isomerase, phosphoglucose isomerase, glucokinase, <*- 
10 mating factor pheromone, a-mating factor pheromone, the PRB1 promoter, 
Ihe PRA1 promoter, the GPD1 promoter, and hybrid promoters involving 
hybrids of parts of 5' regulatory regions with parts of 5' regulatory regions 
of other promoters or with upstream activation sites (e.g. the promoter of 
EP-A-25S 067). 

15 

Suitable transcription termination signals are well known in the art, Where 
the host cell is eukaryotic, the transcription termination signal is preferably 
derived from the 3' flanking sequence of a eukaryotic gene, which contains 
proper signals for transcription termination and poly&denylatian. Suitable 3' 
20 flanking sequences may, for example, be those of the gene naturally linked 
to the expression control sequence used, i.e. may correspond to the 
promoter. Alternatively, they may be different. In that case, and where the 
host is a yeast, preferably S. cerevisiae, then the termination signal of the S. 
cerevisiae ADH1, ADH2, CYC1, or PGK1 genes are preferred. 



25 



In one embodiment, the favoured regulatory sequences in yeast, such as 
Saccharomyces cerevisiae, include: a yeast promoter (e.g. the 
Saccharomyces cerevisiae PRB1 promoter), as taught in EP 43 1 880; and a 
transcription terminator, preferably the terminator from Saccharomyces 
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ADHI, as taught in EP 60 057. Preferably, the vector incorporates at least 
two translation stop codons. 

It may be beneficial for the non-coding region to incorporate more than one 
5 DNA sequence encoding a translations! stop codon, such as UAA, UAG or 
UGA, in order to minimise traiislational read-throngh and thus avoid the 
production of elongated, non-natural fusion proteins. The translation stop 
codon UAA is preferred. 

10 The term "operably linked" includes within its meaning that a regulatory 
sequence is positioned within any non-coding region in a gene such that it 
forms a relationship with an ORF that permits the regulatory region to exert 
an effect on the ORF in its intended manner. Thus a regulatory region 
"operably linked" to an ORF is positioned in such a way that the regulatory 

is region is able to influence transcription and/or translation of the ORF in the 
intended maimer, under conditions compatible with the regulatory 
sequence. 

In one preferred embodiment, the heterologous protein is secreted. In that 
20 case, a sequence encoding a secretion leader sequence which, for example, 
comprises most of the natural HSA secretion leader, pins a small portion of 
the S. cerevisiae a-mating factor secretion leader as taugbtin WO 90/01063 
may be included in the open reading frame. 

25 Alternatively, the heterologous protein may be intracellular. 

In another preferred embodiment, the heterologous protein comprises the 
sequence of a cukaryotic protein, or a fragment or variant thereof. Suitable 
eukaryotes include fungi, plants and animals. In one preferred embodiment 
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20 



25 



the heterologous protein is a fungal protein, such as a yeast protein In ^ 
another preferred embodiment the heterologous protein is an animal protein. " 
Exemplary animals include vertebrates and invertebrates. Exemplary 
vertebrates include mammals, such as humans, and non-human mammals. 

Thus the heterologous protein may comprise the sequence of a yeast 
protein. It may, for example, comprise the sequence of a yeast protein from 
the same host from which the 2um-family plasmid is derived. Those skilled 
in Hie art will recognise that a method, use or plasmid of the first, second or 
fluid aspects of Hie invention may comprise DNA sequences encoding more 
than one heterologous protein, more than one chaperone, or more than one 
heterologous protein and more than one chaperone. 

In another preferred embodiment, the heterologous protein may comprise 
the sequence of albumin, a monoclonal antibody, an etoposide, a serum 
protein (such as a blood clotting factor), antistasin, a tick anticoagulant 
peptide, transferrin, lactoferrin, endostatin, angiostatin, collagens, 
immunoglobulins, Fab' fragments, F(ab')2, SC Ab, SC Fv, interferons, 
interleukins, IL10, IL11, IL2, interferon a species and sub-species, 
interferon p species and sub-species, interferon y species and sub-species, 
leptin, CNTF, CNTFa^s, IL1 -receptor antagonist, erythropoietin (EPO) and 
EPO mimics, thrombopoietin (TPO) and TPO mimics, prosaptide, 
cyanovirin-N, 5-helix, T20 peptide, T1249 peptide, HIV gp41, HIV gpl20, 
urokinase, prourokinase, tPA, hirudin, platelet derived growth factor, 
parathyroid hormone, proinsulin, insulin, glucagon, glucagon-like peptides, 
insulin-like growth factor, calcitonin, growth hormone, transforming growth 
factor p, tumour necrosis factor, G-CSF, GM-CSF, M-CSF, FGF, 
coagulation factors in both pre and active forms, including but not limited to 
plasminogen, fibrinogen, thrombin, prothrombin pro-thrombin, von 
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Willebrand's factor, ^antitrypsin, plasminogen activators, Factor VII, 
Factor VIE, Factor IX, Factor X and Factor XHI, nerve growth factor, 
LAO, platelet-derived endothelial cell growth factor (PD-ECGF), glucose 
oxidase, serum cholinesteraae, aprotinin, amyloid precursor protein, intex- 
5 alpha trypsin inhibitor, antithromhin in, apo-lipoprotein species, Protein C, 
Protein S, or a variant or fragment of any of the above. 

A 'Variant", in the context of the above-listed proteins, refers to a protein 
wherein at one or more positions there have been amino acid insertions, 

10 deletions, or substitutions, either conservative or non-conservative, provided 
that such changes result in a protein whose basic properties, for example 
enzymatic activity or receptor binding (type of and specific activity), 
thermostability, activity in a certain pH-range (pH-stabDily) have not 
significantly been changed. "Significantly" in this context means that one 

is skilled in the art would say that the properties of the variant may still be 
different but would not be unobvious over the ones of the original protein. 

By "conservative substitutions" is intended combinations such as Val, lie, 
Leu, Ala, Met; Asp, Glu; Asn, Gin; Ser, Thr, Gly, Ala; Lys, Arg, His; and 
20 Phe, Tyr, Trp. Preferred conservative substitutions include Gly, Ala; Val, He, 
Leu; Asp, Glu; Asn, Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. 

A "varianf 9 typically has at least 25%, at least 50%, at least 60% or at least 
70%, preferably at least 80%, more preferably at least 90%, even more 
25 preferably at least 95%, yet more preferably at least 99%, mostijreferably at 
least 99.5% sequence identity to the polypeptide from which it is derived 

The percent sequence identity between two polypeptides may be determined 
using suitable computer programs, for example the GAP program of the 
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University of Wisconsin Genetic Computing Group and it will be 
appreciated ih&t percent identity is calculated in relation to polypeptides 
whose sequence has been aligned optimally. 



5 The alignment may alternatively be carried out using the Clustal W program 
(Thompson et ah, (1994) Nucleic Acids Res., 22(22), 4673-80). The 
parameters used may be as follows: 

• Fast pairwise alignment parameters: K-tuple(word) size; 1, window size; 
5, gap penally; 3, number of top diagonals; 5„ Scoring method: x 

10 percent 

* Multiple alignment parameters: gap open penalty; 10, gap extension 
penalty; 0.05. 

■ Scoring matrix: BLOSUM. 

15 Such variants may be natural or made using the methods of protein 
engineering and site-directed mutagenesis as are well known hi the art. 

A "fragment", in Hie context of the above-listed proteins, refers to a protein 
wherein at one or more positions there have been deletions. Thus the 

20 fragment may comprise at most 5, 10, 20, 30, 40 or 50% of the complete 
sequence of the full mature polypeptide. Typically a fragment comprises up 
to 60%, more typically up to 70%, preferably up to 80%, more preferably up 
to 90%, even more preferably up to 95%, yet more preferably up to 99% of 
the complete sequence of the full desired protein. Particularly preferred 

25 fragments of a protein comprise one or more whole domains of tb» protein. 

In one particularly preferred embodiment the heterologous protein 
comprises the sequence of albumin or a variant or fragment thereof. 



34 



23/12 '03 13:13 FAX 01159 552201 



ERIC POTTER CLARK SON 



@040 




By "albumin" we include a protein comprising the sequence of an albumin 
protein obtained from any source. Typically the source is mammalian. In 
one preferred embodiment the serum albumin is human serum albumin 
("USA"). The term "human serum albumin" includes the meaning of a 

5 serum . albumin having an amino acid sequence naturally occurring in 
hu mane and variants thereof. Preferably the albumin has the amino acid 
sequence disclosed in WO 90/13653 or a variant thereof. The HSA coding 
sequence is obtainable by known methods for isolating cDNA 
corresponding to human genes, and is also disclosed in, for example, EP 73 

10 646 and EP 286 424. 

In another preferred embodiment me "albumin" comprises the sequence of 
bovine serum albumin. The term 'Wine serum albumin" includes the 
meaning of a serum albumin having an amino acid sequence naturally 
15 occurring in cows, for example as taken from Swissprot accession number 
P02769, and variants thereof as defined below. The tenn "bovine serum 
albumin" also includes the meaning of fragments of full-length bovine 

serum albumin or variants thereof; as defined below. 

20 In anoihier preferred embodiment the albumin comprises the sequence of an 
albumin derived from one of serum albumin from dog (e.g. see Swissprot 
accession number P49822), pig (e.g. see Swissprot accession number 
P0S835), goat (e.g. as available from Sigma as product no. A2514 or 
A4164), turkey (e.g. see Swissprot accession number O73860), baboon (e.g. 

25 as available from Sigma as product no. A1516), cat (e.g. see Swissprot 
accession number P490S4), chicken (e.g. see Swissprot accession number 
P19121), ovalbumin (e.g. chicken ovalbumin) (e.g. see Swissprot accession 
number P01012), donkey (e.g. see Swissprot accession number F39090), 
guinea pig (e.g. as available from Sigma as product no. A3060, A2G39, 
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05483 or A6539), hamster (e.g. as available from Sigma as product no. ^ 
A5409), horse (e.g. see Swissprot accession number P35747), rhesus 
monkey (e.g. see Swissprot accession number Q28522), mouse (e.g. see 
Swissprot accession number 089020), pigeon (e.g. "as defined by Khan et 
al, 2002, Int. J. Biol. MacromoL, 30(3-4), 171-8), rabbit (e.g. see Swissprot 
accession number P49065), rat (e.g. see Swissprot accession number 
P36953) and sheep (e.g. see Swissprot accession number P 14639) and 
includes variants and fragments thereof as defined below. 

Many naturally occurring mutant forms of albumin are known. Many are 
described in Peters, (1996, Al! About Albumin: Biochemistry, Genetics and 
Medical Applications, Academic Press, Inc., San Diego, California, p. 170- 
181). A variant as defined above may be one of these naturally occurring 
mutants. 



A "variant albumin" refers to an albumin protein wherein at one or more 
positions there have been amino add insertions, deletions, or substitutions, 
either conservative or non-conservative, provided that such changes result in 
an albumin protein for which at least one basic property, for example binding 
activity (type of and specific activity e.g. binding to bilirubin), osmolality 
(oncotic pressure, colloid osmotic pressure), behaviour in a certain pH-range 
(pH-stability) lias not sigiiifieanlly been changed. "Significaiiuy' in this 
context means Ihat one skilled in the art would say that the properties of the 
variant may still be different but would not be unobvious over tbe ones of the 
25 original protein. 



By "conservative substitutions" is intended combinations such as Gly, Ala; 
Val, fie, Leu; Asp, Glu; Asn, Gin; Ser, Thr, Lys, Arg; and Phe, Tyr. Such 
variants may be made by techniques well known in the art, such as by site- 
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directed mutagenesis as disclosed in US Patent No 4,302,386 issued 24 
November 19S1 to Stevens., incorporated herein by reference. 

Typically an albumin variant will have more than 40%, usually at least 50%, 
5 more typically at least 60%, preferably at least 70%, more preferably at least 
80%, yet more preferably at least 90%, even more preferably at least 95%, 
most preferably at least 98% or more sequence identity with, naturally 
occurring albumin. The percent sequence identity between two polypeptides 
may be determined using suitable computer programs, for example the GAP 
10 program of the University of Wisconsin Genetic Computing Group and it 
will be appreciated that percent identity is calculated in relation to 
polypeptides whose sequence has been aligned optimally. The alignment 
may alternatively be carried out using the Clustal W program (Thompson et 
aL, 1994). The parameters used may be as follows: 



15 



20 



Fast pairwise alignment parameters: K-tuple(word) size; l s window size; 5, 
gap penalty; 3, number of top diagonals; 5, Scoring method: x percent- 
Multiple alignment parameters: gap open penalty; 10, gap extension 
nenaltv: 0.05. Scorins matrix: BLOSUM, 



The term "fragment" as used above includes any fragment of full-length 
albumin or a variant thereof, so long as at least one basic property, for 
example binding activity (type of and specific activity e.g. binding to 
bilirubin), osmolality (oncotic pressure, colloid osmotic pressure), behaviour 
25 in a certain pH-range (pH-stabitity) has not significantly btfen changed- 
"Significantly*' in this context means that one skilled in the art would say that 
the properties of (he variant may still be different but would not be unobvious 
over the ones of the original protein. A fragment will typically be at least 50 
amino acids long. A fragment may comprise at least one whole sub-domain 
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of albumin. Domains of HSA have been expressed as recombinant proteins 
(Dockal, M. etal., 1999, J. Biol. Ckem., 274, 29303-29310), where domain 
I was defined as consisting of amino acids 1-197, domain II was defined as 
consisting of amino acids 189-385 and domain HI was defined as consisting 
s of amino acids 381-585. Partial overlap of the domains occurs because of 
the extended a-helix structure (hlO-hl) which exists between domains I and 
II, and between domains H and III (Peters, 1996, op. tit., Table 2-4). HSA 
also comprises six sub-domains (sub-domains IA, IB, IIA, HB, JIIA and 
IIIB). Sub-domain IA comprises amino acids 6-105, sub-domain IB 

io comprises amino acids 120-177, sub-domain IIA comprises amino acids 
200-291, sub-domain IIB comprises amino acids 316-369, sub-domain IIIA 
comprises amino acids 392-491 and sub-domain EHB comprises amino 
acids 512-583. A fragment may comprise a whole or part of one or more 
domains or sub-domains as defined above, or any combination of those 

is domains and/or sub-domains. 



• 



In another particularly preferred embodiment the heterologous protein 
comprises the sequence of transferrin or a variant or fragment thereof. The 
term "transferrin" as used herein includes all members of the transferrin 

20 family (Testa, Proteins of iron metabolism, CRC Press, 2002; Harris & 
Aisen, Iron carriers and iron proteins, Vol 5, Physical Bioraorganic 
Chemistry, VCH, 1991) and their derivatives, such as transferrin, mutant 
transferrins (Mason ei al, 1993, Biochemistry, 32, 5472; Mason et al, 1998, 
Biockem. J., 330(1), 35), truncated transferrins, transferrin lobes (Mason et 

25 al, 1996, Protein Expr. Purify 8 ? 119; Mason et al 9 1991, Protein Expr, 
Purify 2, 214), lactoferrin, mutant lactofcrrins, truncated lactoferrins, 
lactofenin lobes or fusions of any of the above to other peptides, 
polypeptides or proteins (Shin et a!, 1995, Froc. Natl Acad. Set USA, 92, 
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2820; Ali ei al 7 1999, J. Biol Chem., 274, 24066; Mason et al, 2002, 
Biochemistry, 41, 9448). 

The transferrin may be human transferrin. The term "human iransferrifi" is 
used herein to denote material which is indistinguishable from transferrin 
derived from a human or which is a variant or fragment thereof. A 
'Variant" includes insertions, deletions and substitutions, either conservative 
or non-conservative, where such changes do not substantially alter the 
useful ligand-binding of immunogenic properties of transferrin. 



Mutants of transferrin are included in the invention. Such mutants may 
have altered immunognenicity. For example, transferrin mutants may 
display modified (e.g. reduced) glycosylate The N4inked glycosylate 
pattern of a transferrin molecule can be modified by addingfremoving 
15 amino acid glycosylation consensus sequences such as N-X-S/T, at any or 
all of the N 9 X, or S/T position. An example of a transferrin mutant 
modified in this manner is exemplified below. 

We also include naturally-occurring polymorphic variants of human 
20 transferrin or human transferrin anal ogues. Generally, variants or fragments 
of human transferrin will have at least 50% (preferably at least 80%, 90% or 
95%) of human transferrin's ligand binding activity (for example iron- 
binding), weight for weight. The iron binding activity of transferrin or a 
test sample can be determined spectrophotonietrically by 470tmr2$0nm 
25 absorbance ratios for the proteins in their iron-free and fully iron-loaded 
states- Reagents should be iron-free unless stated otherwise. Iron can be 
removed from transferrin or the test sample by dialysis against 0.1M citrate, 
0.1M acetate, 10mM EDTA pH4.5, Protein should be at approximately 
20mg/mL in lOOmM* HEPES, lOmM NaHC0 3 pHS-0- Measure the 
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47Gnm:280nm absorbance ratio of apo-traasferrin (Calbiochcm, CN 
Biosciences, Nottingham, UK) diluted in water so that absorbance at 280nm 
can be accurately determined spectrophotometrically (0% iron binding). 
Prepare 20mM iron-nitrilotriacetate (FeNTA) solution by dissolving 191mg 
5 nitrotriacetic acid in 2mL 1M NaOH, then add 2mL 0.5M ferric chloride. 
Dilute to 5GmL with deionised water. Fully load apo-transferrin with iron 
(100% iron binding) by adding a sufficient excess of freshly prepared 
20mM FeNTA, then dialyse the holo-transferrin preparation completely 
against 100mM HEPES, lOmM NaHC0 3 pHS.O to remove remaining 
10 FeNTA before measuring the absorbance ratio at 470mn:2S0nm. 

the procedure using test sample, which should initially be free from iron, 
and compare final ratios to the control. 

... ■'• Additionally, single or multiple heterologous fusions comprising any of tne 
IS above; or single or multiple heterologous fusions to albumin, transferrin or 
iramunoglobins or a variant or fragment of any of these may be used. Such 
fusions include albumin N-terminal fusions, albumin C-terminal fusions and 
co-N-terminal and C-terminal albumin fusions as exemplified by WO 
01/79271, and transferrin N-terminal fusions, transferrin C-terminal fusions, 
20 and co-N-terminal and C-terminal transferrin fusions. 

Examples of transferrin fusions are given in US patent applications 
US2003/0221201 and US2003/0226J 55, Shin, et al, 1995, Proc Natl Acad 
Sci USA, 92, 2820, Ah, et at., 1999, J Biol Chem, 274, 24066, Mason, et 
is al., 2002, Biochemistry, 41, 9448, the contents of which arc incoiporated 
herein by reference. 

The skilled person will also appreciate that the open reading frame of any 
other gene or variant, or part or either, can be utilised as an open reading 
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feme for use with the present invention. For example, the open reading 
frame may encode a protein comprising any sequence, be it a natural protein 
(including a zymogen), or a variant, or a fragment (which may, for example, 
he a domain) of a natural protein; or a totally synthetic protein; or a single 
5 or multiple fusion of different proteins (natural or synthetic). Such proteins 
can he taken, but not exclusively, from the lists provided in WO 01/79258, 
WO 01/79271, WO 01/79442, WO 01/79443, WO 01/79444 and WO 
01/79480, or a variant or fragment thereof; the disclosures of which are 
incorporated herein by reference. Although these patent applications 
10 present ihe list of proteins in the context of fusion partners for albumin, the 
present invention is not so limited and, for the purposes of the present 
invention, any of the proteins listed therein may be presented alone or as 
fusion partners for albumin, the Fc region of immunoglobulin, transferrin, 
lactqferrin or any other protein or fragment or variant of any of the above, 
15 as a desired polypeptide. 

The heterologous protein may be a therapeutically active protein. In other 
words, it may have a recognised medical effect on individuals, such as 
humans. Many different types of merapeutically active protein are well 
20 known in the art. 

The heterologous protein may comprise a leader sequence effective to cause 
secretion in yeast. 



25 



Numerous natural or artificial polypeptide signal sequences <a1so called 
secretion pre regions) have been used or developed for secreting proteins 
from host cells. The signal sequence directs the nascent protein towards the 
machinery of the cell that exports proteins from the cell into the 
surrounding medium or, in some cases, into the periplasmic space. The 
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10 



signal sequence is usually, although not necessarily, located at the N- 
terminus of the primary translation product and is generally, although not 
necessarily, cleaved off the protein during the secretion process, to yield the 
'■mature" protein. 

In the case of some proteins the entity that is initially secreted, after the 
removal of die signal sequence, includes additional amino acids at its N- 
terminus called a "pro" sequence, the intermediate entity being called a 
"pro-protein". These pro sequences may assist die final protein to fold and 
become functional, and are usually flien cleaved off. In other instances, the 
pro region simply provides a cleavage site for an enzyme to cleave off the 
pre-pro region and is not known to have another function. 

The pro sequence can be removed either during the secretion of the protein 
15 from the cell or after export from the cell into the surrounding medium or 
periplasmic space. 

Polypeptide sequences which direct the secretion of proteins, whether they 
resemble signal (i.e. pre) sequences or pre-pro secretion sequences, are 
20 referred to as leader sequences. The secretion of proteins is a dynamic 
process involving translation, translocation and post-translational 
processing, and one or more of these steps may not necessarily be 
completed before another is either initiated or completed. 



25 



For production of proteins in eukaryotic species such as the yeasts 
Saccharomyces cerevisiae, Zygosacckaromyces species, Kluyveramyces 
lactis and Pichia pastoris, known leader sequences include those from the 
S. cerevisiae acid phosphatase protein (Pho5p) (see BP 36'6" 400), die 
invertase protein (Suc2p) (see Smith et al. (1985) Science, 229, 1219-1224) 
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and heat-shock protein- 150 (HsplSOp) (see WO 95/33833), Additionally 
leader sequences from the S. cerevisiae mating factor alpha- 1 protein 
(MFct-1) and from the human lysozyme and human serum albumin (HSA) 
protein have been used, the latter having been nsed especially, although not 
5 exclusively, for secreting human albumin. WO 90/01063 discloses a fusion 
of the MFa-1 and HSA leader sequences, which advantageously reduces 
the production of a contaminating fragment of human albumin relative to 
the use of the MFa-1 leader sequence. Modified leader sequences are also 
disclosed in the examples of this application and the reader will appreciate 
10 that those leader sequences can be used with proteins other than transferrin. 

Where the chaperone is protein di sulphide isornerase, then preferably the 
heterologous protein comprises disulphide bonds in its mature form. The 
disuljphide bonds may be intramolecular and/or interrnolecular. 

15 

The heterologous protein may be a commercially useful protein- Some 
heterologously expressed proteins are intended to interact with the cell in 
which they are expressed in order to bring about a beneficial effect on the 
cell's activities. These proteins are not, in their own right, commercially 
20 useful, Commercially useful proteins are proteins that have a utility ex vivo 
of the cell in which they are expressed- Nevertheless, the skilled reader will 
appreciate that a commercially useful protein may also have a hioJogical 
effect on the host cell expressing it as a heterologous protein, but that that 
effect is not the main or sole reason for expressing the protein therein. 

25 

In one embodiment it is preferred that the heterologous protein is not p- 

lactamase- In another embodiment it is preferred that the heterologous 

protein is not antistasin. However, the reader will appreciate that neither of 

these provisos exclude genes encoding either p-laclamase or antistasin from 
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being present on the 2jjtm-fkmily plasmid of the invention, merely that the 
gene encoding the heterologous protein encodes a protein other than ft- 
lactamase and/or antistasin, 

5 Plasmids can be prepared by modifying 2jjm-family plasmids known in the 
art by inserting a gene encoding a chaperons and inserting a gene encoding a 
heterologous protein using techniques well known in the art such as are 
described in by Sambrook et aL, Molecular Cloning: A Laboratory Manual, 
2001, 3rd edition „ the contents of which are incorporated herein by 
10 reference. For example, one such method involves ligation via cohesive 
ends. Compatible cohesive ends can be generated on a DNA fragment for 
insertion and plasmid by the action of suitable restriction enzymes. These 
ends will rapidly anneal through complementary base pairing and remaining 
nicks can be closed by the action of DNA ligase. 

15 

A further method uses synthetic double stranded oligonucleotide linkers and 
adaptors. DNA fragments with blunt ends are generated by bacteriophage T4 
DNA polymerase or E.coli DNA polymerase I which remove protruding 3" 
termini and fill in recessed 3 * ends. Synthetic linkers and pieces of blunt- 

20 ended double-stranded DNA, which contain recognition sequences for defined 
restriction enzymes, can be ligated to blunt-ended DNA fragments by T4 
DNA Ilgasc. They arc subsequently digested with appropriate restriction 
enzymes to create cohesive ends and ligated to an expression vector with 
compatible termini. Adaptors are also chemically jsynthesised DNA 

25 fragments which contain one blunt end used for ligation but which also 
possess one preformed cohesive end. Alternatively a DNA fragment or DNA 
fragments can be ligated together by the action of DNA ligase in the presence 
or absence of one or more synthetic double stranded oligonucleotides 
optionally containing cohesive ends. 
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Synthetic linkers containing a variety of restriction endonuclease sites are 
commercially available from a number of sources including Sigma-Genosys 
Ltd, London Road, Pampisford, Cambridge, United Kingdom. 

5 

Appropriate insertion sites in 2jim-family plasmids include, but are not 
limited to, those discussed above. 

The present invention also provides a host cell comprising a plasmid as 
10 defined above. The host cell may be any type of eel]. Bacterial and yeast 
host cells are preferred. Bacterial host cells may be useful for cloning 
puiposes. Yeast host cells may be useful for expression of genes present in 
the plasmid. 

15 In one embodiment the host cell is a yeast cell, such as a member of the 
Saccharomyces, Kluyveromyces, or Pickia genus, such Saccharomyces 
cerevisiae, Kluyveromyces lactis, Pickia pa$tari$ and Pichia 
membranaefaciens, or Zygosaccharomyc&s roindi z Zygosaccharomyc&s 
bailiU Zygosaccharomyces fermenlati, or Kluyveromyces drospkilarum are 

20 preferred. 

The host cell type may be selected for compatibility "with the plasmid type 
being used. Plasmids obtained from one yeast type can be maintained, in 
other yeast types (Irie et ah 1991, Gene, 108(1), 139-144; Irie et a! y 1991, 
25 MoL Gen. Genet, 225(2), 257^265). For example, -pSRl from 
Zygosaccharomyces rouxii can be maintained, in Saccharomyces cerevisiae. 
Preferably, the host cell is compatible with the 2j.Lm-family plasmid used 
(see below for a foil description of the following plasmids). For example, 
where the plasmid is based on pSRl, pSB3 or pSB4 then a suitable yeast 
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cell is Zygosaccharomyces rouxii; where the plasmid is based on pSB 1 or 
pSB2 then a suitable yeast cell is Zygosaccharomyces bailli; where the 

plasmid is based on pSMl then a suitable yeast cell is Zygosaccharomyces 
fermentatt; where the plasmid is based on pKDl then a suitable yeast cell is 
5 Kluyveromyces drosophilarum- where the plasmid is based on pPMl then a 
suitable yeast cell is Pichia membranaefaciens; where the plasmid is based 
on the 2|j,m plasmid then a suitable yeast cell is Sacckaromyces cerevisiae 
or Sacckaromyces carlsbergensis. It is particularly preferred that the 
plasmid is based on the 2p.m plasmid and the yeast cell is Sacckaromyces 
10 cerevisiae. 
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A 2uMi-family plasmid of the invention can be said to be "based on" a 
naturally occurring plasmid if it comprises one, two or preferably three of 
the genes FLP, REP J and RBP2 having sequences derived from that 
15 naturally occurring plasmid. 



It may be particularly advantageous to use a yeast deficient in one or more 
protein mannosyl transferases involved in O-glycosylation of proteins, for 
instance by disruption of the gene coding sequence. 

20 

Recombinantly expressed proteins can be subject to undesirable post- 
translational modifications by the producing host cell. For example, the 
albumin protein sequence docs not contain any sites for N-linfced 
glycosylation and has not been reported to be modified, in nature, by O- 
25 linked glycosylation. However, it has been found that recombinant human 
albumin ("rHA") produced In a number of yeast species can be modified by 
O-linked glycosylation, generally involving matinose. The mannosylated 
albumin is able to bind to the lectin Concanavalin A. The amount of 
mannosylated albumin produced by the yeast can be reduced by using a 
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yeast strain deficient in one or mote of the PMT genes (WO 94/04687). 
* The most convenient way of achieving tbis is to create a yeast which has a 

defect in its genome such that a reduced level of one of the Pmt proteins is 
produced. For example, there may be a deletion, insertion or transposition 
5 in the coding sequence or the regulatory regions (or in another gene 
regulating the expression of one of the PMT genes) such that little or no 
Pmt protein is produced. Alternatively, the yeast could be transformed to 
produce an anti-Pint agent, such as an anti-Pmt antibody. 



10 If a yeast other than S. cerevisiae is used, disruption ot one or more or me 
genes equivalent to the PMT genes of 5. cerevisiae is also beneficial, e.g. in 
Pichia pastoris or Kluyveromyccs lactis. The sequence of PMTl (or any 
other PMT gene) isolated from S. cerevisiae may be used for the 
identification or disruption of genes encoding similar enzymatic. activities in 

15 other fungal species. The cloning of the PMTl homologue of 
Kluyveromyces lactis is described in WO 94/04687. 

The yeast will advantageously have a deletion of the HSP150 and/or YAP3 
genes as taught respectively in WO 95/33 833 and WO 95/23857. 



20 



A plasmid as defined above, may be introduced into a host through standard 
techniques. With regard to transformation of prokaryotic host cells, see, for 
example, Cohen ai al (1972) Proa Nail. Acad. Set USA 69, 2110 and 
Sambroolc et al (2001) Molecular Cloning. A Laboratory Manual, 3 T Ed. 
25 Cold Spring Harbor Laboratory, Cold Spring Harbor, NY. Transformation of 
yeast cells is described in Sherman et al (1986) Methods In Yeast Genetics, A 
Laboratory? Manual, Cold Spring Harbor, NY. The method of Beggs (197S) 
Nature 275, 104-109 is also useful Methods for die transformation of S. 
cerevisiae are taught generally in EP 251 744, EP 25S 067 and WO 
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vu/u woJ, all ot which are incorporated herein by reference. With regard to m 
vertebrate cells, reagents useful in transfecting such cells, for example calcium ^ 
phosphate and DEAE^dextmn or liposome formulations, are available from 
Stratagene Cloning Systems, or Life Technologies Inc., Gaitoersburg, MD 
5 ■ 20877, USA. 



Electroporation 



art for transforming yeast cell, bacterial cells and vertebrate cells. Methods 
for transformation of yeast by eleciroporation are disclosed in Becker & 
10 Guarente (1 990) Methods Enzymot, 194, 1 82. 

Generally, the plasm id will transform not all of the hosts and it will therefore 
be necessary to select for transformed host cells. Thus, a plasmid may 
v comprise a selectable marker, including but not limited to bacterial 
15 selectable marker and/or a yeast selectable marker. A typical bacterial 
selectable marker is the p-lactamase gene although many others are known 
in the art. Typical yeast selectable marker include LEXJ2, TRP1, HIS3, 
HIS4, UEA3. URA5, SFAI, ADE2, METIS, LYS5, LYS2. JLV2, FBAl and 
PGKL Those skilled in the art will appreciate that any gene whose 

20 chromosomal deletion or inactivation results in an inviable host, so called 
essential genes, can be used as a selective marker if a functional gene is 
provided on the plasnud, as demonstrated for PGK1 in a pgkl yeast strain 
(Piper and Ciirran, 1990, Curr. Genet. 17, 119). Suitable essential gen.es 
can be found within the Stanford Genome Database (SGD), 

25 bttp: ://db .yeastgenome. org). * 

Additionally, a plasmid according to any one of the first, second or third 
aspects of the present invention may comprise more than one selectable 
marker. 
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One selection technique involves incorporating into the expression vector a 
DNA sequence marker, with any necessary control elements, that codes for a 
selectable trait in the transformed cell. These markers include dihydrofolate 
reductase, G418 or neomycin resistance for eufcaryotic cell culture, and 
tetracyclic kanamycin or ampicillin (i.e. p-lactamase) resistance genes for 
mlruring in E.coli and other bacteria. Alternatively, the gene for such 
selectable trait can be on another vector, which is used to co-transform the 
desired host cell. 



Another method of identifying successfully transformed colls involves 
growing the cells resulting from the introduction of a plasmid of the invention, 
optionally to allow the expression of a recombinant polypeptide (i.e. a 
polypeptide which is encoded by a polynucleotide sequence on Hie plasmid 

15 and is heterologous to the host cell, in the sense that that polypeptide is not 
naturally produced by the host). Cells can be harvested and lysed and their 
DNA or RNA content examined for the presence of the recombinant sequence 
using a method such as that described by Southern (1975) J. Mot. Biol 9S, 
503 or Berent er al (1985) Biotech. 3, 208 or other methods of DNA and RNA 

20 analysis common in the art Alternatively, the presence of a polypeptide in 
Hie supernatant of a culture of a transformed cell can be detected using 
antibodies. 

Li addition to directly assaying for the presence of recombinant DNA, 
25 successful transformation can be confirmed by well known immunological 
methods when the recombinant DNA is capable of directing the expression of 
Hie protein. For example, cells successfully transformed with an expression 
vector produce proteins displaying appropriate antigenicity. Samples of cells 
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suspected of being transformed are harvested and assayed for the protein 
using suitable antibodies. 

Thus, in addition to the teansfoEmed host cells themselves, the present 
5 invention also contemplates a culture of those cells, preferably a monoclonal 
(clonally homogeneous) culture, or a culture derived from a monoclonal 
culture, in a nutrient medium. 





pp 


11 


Iff 



10 appropriate conditions known to those skilled in the art, and in view of the 
teachings disclosed herein, to permit the egression of the chaperone and 
heterologous protein encoded by ihe plasmid. 

The culture medium may be non-selective or place a selective pressure on 
is the maintenance of the plasmid. 



19 055 



The thus produced heterologous protein may be present intracellularly or, if 
secreted, in the culture medium and/or periplasmic space of the host cell. 

The step of '^purifying the thus expressed heterologous protein from the 
cultured host cell or the culture medium" optionally comprises ceJU 
immobilization, cell separation and/or cell breakage, but always comprises 
at least one other purification step different from the step or steps of cell 
immobilization, separation and/or breakage. 

t 

Cell immobilization techniques, such as encasing the cells using calcium 
alginate bead, are well known in the art. Similarly, cell separation 
techniques, such as centrifugation, filtration (e.g. cross-flow filtration, 
expanded bed chromatography and the like are well known in the art. 

50 



.23/12 '03 13;18 FAX 01159 552201 



ERIC POTTER CLARKSON 



111056 




Likewise methods of cell breakage, including beadmilling, sonication, 
enzymatic exposure and the like are well known in the art 

The at least one other purification step may be any other step suitahle for 
5 protein purification known in the art. Fox example purification techniques 
for the recovery of recombinantly expressed albumin have been disclosed 
in: WO 92/04367, removal of matrix-derived dye; EP 464 590, removal of 
yeast-derived colorants; EP 319 067, alkaline precipitation and subsequent 
application of me albumin to a lipophilic phase; and WO 96/37515, US 5 
72S 553 and WO 00/44772, which describe complete purification 
processes; all of which are incorporated herein by reference. 



10 



Proteins other than albumin may be purified from Hie culture medium by 
any technique that has been found to be useful for purifying such proteins. 

Suitable methods include ammonium sulphate or ethanol precipitation, acid or 
solvent extraction, anion or cation exchange chromatography, 

phosphocellulose chromatography, hydrophobic interaction chromatography, 
affinity chromatography, hydroxylapatite chromatography, lectin 
20 chromatography, concentration, dUution, pM adjustment, dianllration, 

ultrafiltration, high perfotmance liquid chromatography ("I-IPLC"), reverse 

phase HPLC, conductivity adjustment and the like. 

Thus, the heterologous protein can be separated from contaminating 
25 molecules and/or concentrated. It is preferred that the purified -heterologous 
protein is provided at a concentration of at least 0.03 gX" 1 , 0.04 gX , 0.05 
g.I/\0.06 gX-',0.07 g.I/\ 0.08 g-L" 1 , 0.09 gl" 1 , 0-1 gX 1 , 0.2 g.L' 1 , 0.3 g.L" 1 , 
0.4 g-L' 1 , 0.5 gX" 1 , 0-6 g-L" 1 , 0.7 gX 1 , 0.8 gX l , 0.9 gX \ I gX 1 , 2 gX\ 3 
gX 1 , 4 g-L' 1 , 5 gX 1 , 6 g.X l , 7 gX 1 , 8 gX"', 9 gX' 1 , 10 gX'\ 15 gX 1 , 20 gX 
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10 



20 



\ 25 gX" 1 , 30 gX 1 , 40 gX',50 gX" 1 , 60 gi 1 , 70 gX"', 70 gX"', 90 gX"', 100 

gX 1 , 150 gX 1 , 200 gX',250 gX 1 , 300 g.V\ 350 g.X\ 400 gX 1 , 500 gX'\ ^ 

600 gX 1 , 700 gX" 1 , 800 gX 1 , 900 g.X l , 1000 gX" 1 , ormore. 
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It is preferred that the heterologous protein is purified to achieve a 
pharmaceutically acceptable level of purity. A protein has a 
pharmaceutical^ acceptable level of purity is it is essentially pyrogen free 
and can be administered in a pharmaceutically efficacious amount without 
causing medical effects not associated with the activity of the protein. 

The resulting heterologous protein may be used for any of its known 
utilities, which, in the case of albumin, include i.v. administration to 
patients to treat severe burns, shock and blood loss, supplementing culture 
media, and as an excipient in formulations of other proteins. 

Although it is possible for a fherapeuticaUy useful heterologous protein 
obtained by a process of the of the invention to be administered alone, it is 
preferable to present it as a pharmaceutical formulation, together with one or 
more acceptable carriers or diluents. The carrier(s) or diraent(s) must be 
"acceptable" in the sense of being compatible with the desired protein and not 
deleterious to the recipients thereof. Typically, the carders or diluents will be 
water or saline which will be sterile and pyrogen free. 

Optionally the thus formulated protein will be presented in a unit dosage 
25 form, such as in the form of a tablet, capsule, injectable solution or the like. 

A further embodiment of the present invention provides a host cell 
recombinantly encoding proteins comprising the sequences of PDI and 
transferrin-based proteins. By "transferrin-based protein" we mean 
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transferrin or any other member of the transferrin family (e.g. lactoferrin), a 
W variant or fragment thereof or a fusion protein comprising transferrin, a 

variant or fragment thereof, including the types described above. Thus the 
present invention also provides for the use of a recombinant PDI gene to 
5 increase the expression of a tmusferrm-based protein. 

The PDI gene may be provided on a plasmid, such as a 2Mm-family plasmid 
as described above. Alternatively, the PDI gene may be ehromosomally 
integrated. In a preferred embodiment, the PDI gene is ehromosomally 
10 integrated at the locus of an endogenous!/ encoded PDI gene, preferably 
without disrupting the expression of the endogenous PDI gene. 

The gene encoding the transferrin-based protein may be provided on a 
plasmid, such as a 2ura-family plasmid as* described above, or may be 
1 s ehromosomally integrated, such as at the locus of an endogenously encoded 
PDI gene, preferably without disrupting the expression of the endogenous 
PDI gene. 

In one embodiment the PDI gene is ehromosomally integrated and the gene 
20 encoding the transferrin-based protein is provided on a plasmid. In another 
embodiment, the PDI gene is provided on a plasmid and the gene encoding 
the transferrin-based protein is ehromosomally integrated. In another 
embodiment both the PDI gene and the gene encoding the transferrin-based 
protein are ehromosomally integrated. In another embodiment both the PDI 
is gene and the gene encoding Ibe transferrin-based protein are provided on a 
plasmid. 

Ajs is apparent from the examples of the present application, the 
combination of recombinant^ expressed PDI and transferrin-based proteins 
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provides a surprisingly high level of transferrin expression. For example, 
transferrin expression in a system that includes a chromosomally encoded 
recombinant PDI gene provided a 2-fold increase (compared to a control in 
which there is no chromosomally encoded recombinant PDI gene). This 
5 increase was 5-thnes greater than an equivalent system comprising a 
recombinant gene encoding human albumin in place of the recombinant 
transferrin gene. 

The host may be any cell type, such as a prokaryotic cell (e.g. bacterial cells 
10 such as B. coli") or a eukaryotic cell Preferred eukaryotic cells include 
fungal cells, such as yeast cells, and mammalian cells. Exemplary yeast 
cells are discussed above. Exemplary mammalian ceils include human 
cells. 



• 



is Host cells as described above can be cultured to produce recombinant 
transferrin-based proteins. The thus produced fransferrm-based proteins can 
be isolated from the culture and purified, preferably to a pharmaceutical^ 
acceptable level of purity, for example using techniques known in the art 
and/or as set out above. Purified transferrin-based proteins may be 
20 formulated with a pharmaceutical^ acceptabl e carri er or diluent and m ay be 
presented in unit dosage form. 

The present invention will now be exemplified with reference to the 
following non-h'mitmg examples and figures. 

25 
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BRIEF DESCRIPTION OF THE FIGURES 

Figures 1, 2, 4, 6 to 15, 22 and 25 show various plasmid maps. 
5 Figure 3 shows plasmid insertion sites. 

Figure 5 shows a restriction map of a DNA fragment containing the PDI 
coding sequence. 



to 



Figure 16 shows the results of rocket immunoclectrophoresis (RIE) 
determination of increased recombinant transferrin (N413Q, N611Q) 
secretion with PDU over-expression. Cryopreserved yeast stocks were 
grown for 4-days in lOmL BMMD shake flask cultures and supernatants 
were loaded at 5uL per well. Goat polyclonal anti-transfemn (human) 
15 antiserum (Calbiochem) was used at 40uL per rocket 
innnunoelectrophoresis gel (50mL). A = Control strain [pSAC35] s 
duplicate flasks; B = Control strain [pDB2536], duplicate flasks; C = 
Control strain [pDB2711], neat to 40-fold aqueous dilutions; D = Control 
strain [pDB2931], duplicate flasks; E - Control strain [pDB2929], neat to 
20 40-fold aqueous dilutions. 

Figure 17 shows the results of RIE analysis of recombinant transferrin 
(N413Q, N611Q) secretion with and without PDI1 over-expression, 
Cryopreserved yeast stocks were grown for 4-days in lOmL BMMD shake 
25 flask cultures and supernatants were loaded at 5uL per welK. Duplicate 
loadings were made of supernatants from two individual cultures of each 
strain. Goat polyclonal anti-transferrin (human) antiserum (Calbiochem) 
was used at 40uL per rocket Immunoelectrophoresis gel (50mL). A = 
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Control strain [pSAC35]; B = Control strain fpDB253fi] ; C = Control strain 
[pDB271 1]; D - Control strain [pDB293 1]; E = Control strain [pDB2S>29]. 

Figure 18 shows the results of SDS-PAGE analysis of recombinant 
5 transferrin secretion with and without PDI1 over-expression. BMMD shake 
flask cultures were grown for 4-days and 10uL supernatant analysed on 
non-reducing SDS-PAGE (4-12% NuPAGE®, MOPS buffer, mVitrogen) 
with GelCode® Blue reagent (Pierce). SeeBlue Plus2 Markers (InVitrogen). 
1 - pDB2S36; 2 = pDB2536; 3 = pDB271 1; 4 - pDB271 I; 5 = P DB293 1; 6 
10 = P DB293 1; 7 - pDB2929; 8 = pDB2929; 9 = pSAC35 control. 

Figure 19 shows RIE analysis of recombinant transferrin secretion from S. 
cerevisiae strains with an additional integrated copy of PDI1. 5-day BMMD 
shake flask culture supematants were loaded at 5niL per well. Strains 

15 contained; 1) pSAC3S (negative control); 2) pDB253<5 (recombinant non- 
glycosylated transferrin (N413Q, N611Q)) or 3) pDB2506 (same as plasmid 
pDB2536 but the transferrin ORP encodes transferrin without the N-KJ 
mutations at positions 413 and 611, i.e, recombinant glycosylated 
transferrin). Each well contained a sample derived from an individual 

20 transformant Standards were human plasma holo-transferrin (Calbiochem) 
at .100, 50, 20.,. 10, 5 and 2mg.L I . 



Figure 20 shows RIE analysis of recombinant transferrin secretion from 
Strain A [pDB2536] and Strain A (pDB2506J grown in shake flask culture. 
25 5-day BMMD shake flask culture supematants were loaded in duplicate at 
5mL per well. 

Figure 21 shows SDS-PAGE analysis of recombinant transferrin secreted 
from Strain A |pDB2536] and Strain A [pDB2506] grown in shake flask 
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culture. Cultures were grown for 5-days in BMMD and 30mL supernatants 
analysed on SDS-PAGB (4-12% NuPAGE™, MOPS Buffer, InVitrogen) 
stained with GelCode, Blue Reagent (Pierce). 1) Strain A [pDB2536] 
transformant 1; 2) Strain A [pDB2536] transformant 2; 3) Strain A 
5 [pSAC35] control; 4) Strain A [pDB2506] transformant 1; 5) SeeBlue, 
Plus2 Protein Standards (approximate molecular weights only). 



Figure 23 shows RTE of recombinant transferrin secreted from S. cerevisiae 
strains with different PDI1 copy numbers. 3-day BMMD shake flask 
10 culture supernatants were loaded at 5mL per well. Goat polyclonal anti- 
transferrin (human) antiserum (Calbiochem) was used at 30mL per rocket 
irnmunoelectropboresis gel (50mL). (A) supernatant from S. cerevisiae 
control strain [pDB2711] or [pDB2712l; (B) supernatant from Strain A 
[pDB253ti]; (C) supernatant from control strain |pDB253 6). 

15 

Figure 24 shows SDS-PAGE analysis of recombinant transferrin secreted 
from 5. cerevisiae strains with different PDI1 copy numbers. 4-12% 
NuPAGE reducing gel run with MOPS buffer (InVitrogen) after loading 
with 30mL of 3-day BMMD shake flask culture supernatant per lane; (lane 
20 1) supernatant from control strain [pDB2536]; (lane 2) supernatant from 
Strain A [pDB2536]; (lanes 3-6) supernatant from control strain 
[pDB2711] or £pDB2712]; (lane 7) molecular weight markers (SeeBlue 
Plus2, InVitrogen). 

25 EXAMPLES 

Two types of expression cassette have been used to secrete a recombinant 
. human transferrin mutant (N413Q, N611Q) from 6'. cerevisiae. One type 
uses a modified HSA(pre)/MFal(pro) leader sequence (named the 
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"modified fusion leader" sequence). The second type of expression cassette 
uses only tke modified H£A(pre) leader sequence. 

The 24 amino acid sequence of the "modified fusion leader" is 
5 MKWVFXVSILFLFSSAYSRSLDKR. 

The 18 amino acid sequence of the modified HSA(pre) leader sequence is 
MKWWIVSILFLFSSAYS. 

10 Transferrin (N413Q, N61 1Q) expression using these two cassettes has been 
studied in S. cerevisiae using the 2um expression vector with and without 
an additional copy of the S. cerevisiae PDI gene, PDII. 

Example 1: Constr uction of expression plasmids 

15 

A 52-bp linker made by annealing 0.5mM solutions of oligonucleotides 
CF86 and CF87 (sec below) was introduced into the US-region of the 2pm 
plasmid pSAC35 at the ^cwil-sites in the 599-bp inverted repeats. One 
Xcml-site cuts 51 -bp after the REP2 translation termination codon, whereas 
20 the other XcmL-site cuts 127-bp before the end of the FLP coding sequence, 
due to overlap with the inverted repeat (see Figure 3). This DNA linker 
contained a core region "SnaBI-PacI-FseVSffl-Sma\-SnaBT\ which 
encoded restriction sites absent frompSAC35. 

25 Xcml Linker rCFS6+CFS7^ 

SfiT 

Pad SnaBI 

30 SnaBI Fsel SwaT 

CFB6 GGAGTGG-TA CGTATTAATT AAGGCCGGCC AGGCCCGGGT ACGTACCAAT TGA 
CF87 TCCTCACCAT GCATAATTAA TTCCGGCCGG TCCGGGCCCA T<3 C ATGGTT A. /VC 

5S 
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Plasmid pSAC35 was partially digested with. J&ml, the linear 11-kb 
fragment was isolated from a 0.7%(w/v) agarose gel, ligated with the 
CF86/CFS7 Xctnl linker (neat, 10" 1 and 10" 2 dilutions) and transformed into 
E. coli DHSct- Arnpicillin resistant transformants were selected and 
screened for the presence of plasmids that could he linearised by Smal 
digestion. Restriction enzyme analysis identified pDB26S8 (Figure 4) wrfh 
die linker cloned into the Xewl-site after KEP2. DNA sequencing using 
oligonucleotides primers CF88, CF9S and CF99 (Table 1) confirmed me 
insertion contained the correct linker sequence. 



Table 1 



15 



Oligonucleotide sequencing primers: 



Primer 



Description 



CF88 REP2 primer, 20mer 



Sequence 



5 J -ATCACGTAATACTTCTAGGG-3 ' 



CF98 BEP2 primer, 20mer 5'-AGAGTGAGTTGGAaGGaaGo-3' 



CF99 REP2 primer, 20mer 



5'-AQCTCGTAAGCGTCGTTACC-3' 



The yeast strain was transformed to leucine prototrophy using a modified 
lithium acetate method (Sigma yeast transformation kit, YEAST- 1 „ protocol 
2; (Ito et cd, 1983, J. Bacterial., 133, 163; Bible, 1992, Biatechniques, 13, 
20 IS)). Transformants were selected on BMMD-agar plates, and were 

subsequently patched Oat on BMMD-agar plates. CryopTeserved trehalose 

stocks were prepared from lOmL BMMD shake flask cultures (24 hrs, 
30°C, 200rpm), 
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The composition of YEJPD and BMMD is described by Sleep et at., 2002, 
Yeast, 18, 403. YEPS and BMMS are similar in composition to YEPD and 
BMMD accept that 2% (w/v) sucrose was substituted for the 2% (w/v) 
5 glucose as the sole initial carbon source. 



• 



The S. cerevisiae PDII gene was cloned into the Xcml-Imker of pDB2688. 
The PDII gene (Figure 5) was cloned on a 1.9-kb Sacl-Spel fragment from 
a larger £ cerevisiae genomic DNA fragment containing the PDII gene, 

10 which had been cloned into YIplac211 (Gietz & Sugino, 1988, Gene, 74, 
527-534), which had been treated with T4 DNA polymerase to fill the Spel 
5' -overhang and remove the Sad S'-overhang. This PDII fragment 
included 212-bp of the PDII promoter upstream of the translation initiation 
codon, and 148-bp downstream of the translation termination codon. This 

15 was ligated with Smal linearised/calf intestinal alkaline phosphatase treated 
pDB26"88, to create plasmid pDB2690 (Figure 6% with the PDII gene 
transcribed in the same direction as REP 2. A S. cerevisiae strain was 
transformed to leucine prototrophy with pDB2690. 

20 An expression cassette for a human transferrin mutant (N413Q, N611Q) 
was subsequently cloned into the jSfcjff-site of pDB2690 to create pDB2711 
(Figure 7). The expression cassette in pDB27ll contains the S. cerevisiae 
PUB I promoter, an HSA/MFct fusion leader sequence (EP 387319; Sleep et 
al, 1990, Biotechnology (JSt. Y.), 8, 42) followed by a coding sequence for the 

25 human transferrin mutant (N413Q, N611Q) and the S. cerevisiae ADHI 
terminator. Plasmid pDB2536 was constructed similarly by insertion of the 
same expression cassette into the Mrtl-site of pSAC35. 
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The "modified fusion leader" sequence used in pDB2536 and pDB2711 
comprises a modified HSA-pre sequence and a MFtxl -pro sequence. An 
alternative leader sequence Tised was the modijaed HSA-pre sequence, 
which was derived from the modified fusion leader sequence by removal of 
the six residues of the MFotl-pro sequence. 

The modified fusion leader sequence in pDB2515 (Figure 8) was mutated 
with oligonucleotides CP 154 and CF155 to delete the coding sequence for 
the six residues (RSLDKR) of the MFcd-pro region- This was performed 
10 according to the instruction manual of the Statagene's QnickChange™ Site- 
Directed Mutagenesis Kit pDB2515 is the E. coli cloning vector pGEM- 
7Z(-) (Promega) containing the 2940-bp Notl-RmtiHl (partial) DNA 
fragment of pDB2529 (see below) ligated between the PspQWR and Hindm 
sites, 

15 

CF154 

5 ' -GTTCTTGTTCTCCTCTGCTTACTCTGTCCCTQATAAAAGTGTGAGA 
TGG-3 f 




20 CF155 

5 ' - CC AT CT C AC AGTTTT AT CAGGGACAG AGTAAG C AGAGG AG AACAA 
GAAC - 3 ' 

Competent E. coli DHSa cells were transformed with the mutated plasraids 
25 and ampicillin resistant colonies were selected. Plasmid DNA from these 
colonies was screened by double digestion with EcoSl and BgKL The 
correct DNA sequence for fhe modified HSA-pre leader was subsequently 
confirmed in pDB2921 (Figure 9) over a 386-bp region between the AflU. 
and Hamlil sites cither side of the leader sequence. This 386-bp Aflll- 
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BamHL fragment was isolated,, and ligated with a 6 ? 081-bp Aflll-BatnHI 
fragment from pDB2529 (Figure 10), prepared by partial digestion with 
BamHL and complete digestion with Aflll and calf intestinal alkaline 
phosphatase. P DB2529 is the E. coli cloning vector pBST(+) (Sleep et al, 
2001, Yeast, 18, 403-441) containing the transferrin expression cassette of 
pDB2536 cloned into the unique Afofl-site. This produced pDB2928 
(Figure 1 1), which was isolated from ampicillin resistant J?, coli DH5ot cells 
transformed with the ligation products. 



10 The 3,256-bp NotI expression cassette was isolated from pDB2928. This 

a 

contained the PRBJ promoter, the coding region for the modified HSA-pre 
leader sequence followed by transfeixin (N413Q, N611Q), and the ADH1 
terminator. This was ligated into the Noil sites of the 2jim-based vectors 
pSAC35 and pDB2690 to generate the expression ; plasmids pDB2929 9 
15 pDB2930 5 pDB2931 and pDB2932 (Figures 12-15). ,In P DB2929 and 
pDB293l the transferrin (N413Q, N611Q) sequence is transcribed in the 
same direction as LEU2 7 whereas inpDB2930 and pDB2932 transcription is 
m the opposite direction. 



20 Example 2: Expression of transferrin 



A iS. cerevisiae control strain was transformed to leucine prototrophy with 
all the transferrin (N413Q 3 N61 1Q) expression cassettes, and cryopregerved 
stocks were prepared. 

Strains were grown for four days at 30°C in lOmL BMMD cultures in 
50mL conical flasks shaken at 200ipm. The titles of recombinant 
transferrin secreted into the culture supernatants were compared by rocket 
immunoelectrophoresis (RIE), reverse phase high performance liquid 
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chromatography (RP-HPLC) (Table 2), and non-reducing SDS 
polyaciylamide electrophoresis stained with colloidal Coomassie blue stain 
(SDS-PAGE). The increase its recombinant transferrin secreted when SL 
C&r&visiae PDI1 was over-expressed was estimated to be greater than 10- 
5 fold 



Table 2: 




10 RIE analysis indicated that the increased transferrin secretion in the 
presence of additional copies of PDI1 was approximately 15-fold (Figure 
16). By RIE analysis the increase appeared slightly larger for the modified 
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HSA-pre leader sequence than for the modified fusion leader sequence 

(Figure 17). £ 

By KP-HPLC analysis the increase in transferrin secretion was determined 
5 to be 18-fold for the modified fusion leader sequence and 15-fold for the 
modified HSA-pre leader sequence (Table 2). 

Figure 18 shows an SDS-PAGE comparison of the recombinant transferrin 
secreted by & cerevisiae strains with and without additional PDI1 
10 expression. 



KP-HPLC Method for D etermining Transferrin Expression 
Column: 50 * 4.6mm Phenomenex Jupiter C4 300A, 5\wx 
Column temperature: 45°C 
15 Flow rate: ImL-nrin" 1 

Peak deteotionrUV absorbance at 214mn 
HPLC mobile phase A: 0. 1% TFA, 5% Acetonitrile 
HPLC mobile phase B: 0. 1% TFA, 95% Acetonitrile 
Gradient: 0 to 3 minutes 30% B 

20 3 to 1 3 minutes 30 to 55% B in a linear gradient 

13 to 14 minutes 55% B 

14 to 15 minutes 55 to 30% B in a linear gradient 

15 to 20 minutes 30% B 

Injection: Generally 1 OO^L of sample, but any volume can be inj ected 
25 Standard Curve: 0. 1 to 1 Oug of human transferrin injected vb peak area 
Standard curve used for the results shown was linear up to 10p,g. 

y = 530888.x + 10526.7 
where y = peak area, and x = amount in pg. 
(r 2 ): 0.999953, where Correlation Coefficient - r 
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Example 3: ( 



S. cerevisiae Strain A was selected to investigate the secretion of 
recombinant glycosylated transferrin expression from plasmid pDB250<5 
and recombinant non-glyco sy lated transferrin (N413Q, N611Q) from 
plasmid pDB2536. Strain A has the following characteristics — 



• additional chxomosomally integrated PDI1 gene integrated at the 
10 host PDIJ chromosomal location. 

• the URA3 gene and bacterial DNA sequences containing the 
ampicillin resistance gene were also integrated into the S. cerevisiae 
gfcnome at the insertion sites for the above genes. 



A control strain had none of the above insertions. 



Control strain [cir°] and Strain A [cir°] were transformed to leucine 
prototrophy with pDB2506 (recombinant transferrin);, pDB2536 
20 (recombinant non-glycosylated transferrin (N413Q 3 N611Q)) or pSAC35 
(control). Transformants were selected on BMMD-agar, 



The relative level of transferrin secretion in BMMD shake flask culture was 
determined for each strain/plasmid combination by rocket 
25 Immunoelectrophoresis (RIE). Figure 19 shows that both strains secreted 
both the glycosylated and non-glycosylated recombinant transferrins into 
the culture supernatant 
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The levels of both the glycosylated and non-glycosylafed transferrins 
secreted from Strain A [pDB2506] and Strain A [pt>B2536] respectively, 
appeared higher than the levels secreted from the control strain. Hence, at 
least in shake flask culture, PDI1 integrated into the host genome at the 
5 PDI1 locus in Strain A has enhanced transferrin secretion. 



Furthermore, the increase in transferrin secretion observed between control 
strain [pDB2536] and Strain A [pDB2536] appeared to be at least a 100% 
increase by RIE. In contrast, the increase in rHA monomer secretion 

10 between control strain [pDB2305] and Strain A [pDB2305] was 
approximately 20% (data not shown). Therefore, the increase in transferrin 
secretion due to the additional copy of PDI1 in Strain A was surprising 
large considering that transferrin has 19 disulphide bonds, compared to rHA 
with 17 disulphide bonds. Additional copies of the PDTJ gene may be 

15 particularly beneficial for the secretion from S. cerevisiae of proteins from 
the transferrin family, and their derivatives. 

The levels of transferrin secreted from Strain A [pDB2536] and Strain A 
[pDB2506] were compared by RIE for transformants grown in BMMD and 

20 YEPD (Figure 20). Results indicated that a greater than 2-fold increase in 
litres of both non-glycosylated recombinant transferrin (N413Q, N611Q) 
and glycosylated recombinant transferrin was achieved by growth in YEPD 
(10-20 mg.L 1 serum transferrin equivalent) compared to BMMD (2-5 
mgXT 1 serum transferrin equivalent). The increase in both glycosylated and 

25 non-glycosylated transferrin titre observed in YEPD suggested that both 
expression transferrin plasmids were sufficiently stable under non-selective 
growth conditions to allow the expected increased biomass which usually 
results from growth in YEPD to be translated into increased glycosylated 
and non-glycosylated transferrin productivity- 
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SDS-PAGE analysis of non-glycosylated tratisfemn (N413Q, NG11Q) 
secreted from Steam A [pDB2536] and glycosylated transferrin from Strain 
A jpDB2506] grown in BMMD shake flask culture is shown in Figure 21. 
5 Strain A [pDB2536] samples clearly showed an additional protein band 
compared to the Strain A [pSAC35] control. This extra band migrated at 
the expected position for the recombinant transferrin (N413Q 5 N611Q) 
secreted from control strain [pDB253G]. Strain A [pDB2506] culture 
supernatants appeared tq confcam a diffuse protein band at the position 
10 expected fox transferrin. This suggested that the secreted recombinant 
transferrin was heterogeneous, possibly due to hyper-manosylation at 
Asp41 3 and/or Asp6 11. 

Example 4: Comparing transferrin secretion from & cerevisiae control 
15 strain containing PDB2711 with transferrin secretion from £ cerevisiae 
Strain A 

Plasmid pDB2711 1 is as described above. Plasmid pDB2712 (Figure 22) 
was also produced with the No A cassette in the opposite direction to 
20 • pDB2711. 

Control strain S. cerevisiae [cir°] was transformed to leucine prototropby 
with pDB2711 and pDB27l2. Transformants were selected on BMMD- 
agar and cryopreserved trehalose stocks of control strain [pDB271 1] were 
25 prepared (Table 3). - 
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Table 3: Recombinant transferrin titres from high call density 
fermentations 




Strain 


Supernatant CgX" 1 ) 


GPHPLC 


SDS-PAGE 


Control [pDB2536] 


0.5/0,4 




Alternative control [pDB2536] 


1.5/1.6 


0.6 


m 

OS/0.9 


0.4/0.4/0.5 


Strain A |pDB2536] 


0.7 


0.6 


0.6 






Control (pDB2711] 


3.5 


3.6 


3.4 


2.7/3.1 



5 Secretion of recombinant transferrin (N413Q, N611Q) by control strain 
[pDB27il], control strain [pDB27l2], Strain A [pDB2536] 5 control strain 
[pDB2536] and an alternative control strain [pE>B2536] was compared in 
both BMMD and YEPD shake flask culture. RIE indicated that a 
significant increase in recombinant transferrin secretion bad been achieved 

10 from control strain [pDB2711] with multiple episomal PDI1 copies, 
compared to Strain A [pDB2536] with two chromosomal copies of PDI1, 
* and control strain [pDB2536] with a single chromosomal copy of PDI1 
gene (Figure 23). Control strain [pDB2711] and control strain. [pDB2712] 
appeared to secrete similar levels of rTf (N413Q, N6UQ) into the culture 

15 media. The levels of secretion were relatively consistent between control 
strain [pDB2711] and control strain [pDB27l2] transformants in both 
BMMD and YEPD media, suggesting that plasmid stability was sufficient 
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for high-level transferrin secretion even under non-selective conditions. 
This is in contrast to the previous published data in relation to recombinant 
PDGF-BB and HSA where introduction of PDI1 into multicopy 2pm 

_ * 

plasmids was shown to be detrimental to the host. 



5 



Reducing SDS-PAGE analysis of transferrin secreted from control strain 
[pDB2711], control strain [pDB2712], Strain A [ P DB2536], control strain 
Q>DB2536] and alternative control strain [pDB2536] in BMMD shake flask 
culture is shown in Figure 24. This shows an abundant protein band in all 
10 samples from control strain [pDB271 1 ] and control strain £pDB27 1 2] at the 
position expected for transferrin (N413Q, N6HQ). The relative stain 
intensity of me transferrin (N413Q, N611Q) band from the different strains 
suggested that Strain A [pDB2S36] produced more than control strain 
[pDB2536] and alternative control strain [pDB2536] a but that there was an 
15 even more dramatic increase in secretion from control strain [pDB271 1] 
and control strain [pDB2712]. The increased recombinant transferrin 
secretion observed was concomitant with the increased PDI1 copy number 
in these strains. This suggested that Pdilp levels were limiting transferrin 
secretion in control strain, Strain A and the alternative control strain, and 
20 that elevated PDH copy number was responsible for increased transferrin 
secretion. Elevated FPU copy number could increase the steady state 
expression level a£PDI2 so increasing the amount of Pdilp activity. There 
are a number of alternative methods by which this could be achieved 
without increasing the copy number of the PDI1 gene, for example the 
25 steady state PDI1 mRNA level could be increased by either increasing the 
transcription rate, say by use of a higher efficiency promoter, or by reducing 
the clearance rate of the PDI1 mRNA. Alternatively, protein engineering 
could be used to enhance the specific activity or turnover number of the 
Fdilp protein. 
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In high cell density fermentations control strain [pDB27il] recombinant 
transferrin (N413Q, N611Q) production was measured at approximately 
3gX _1 by both GP-HPLC analysis and SDS-PAGE analysis (Table 3). This 
level of production is several fold-higher than control sixain, the alternative 
control strain or Strain A containing pDB253fr Furthermore, for the 
production of proteins for therapeutic use in humans, expression systems 
such as control strain [pDB271 1] have advantages over those using Strain 
A, as they do not contain bacterial DNA sequences. 

CONCLUSIONS 



Secretion of recombinant transferrin from a multicopy expression plasmid 
(pDB2536) is investigated in $.. cerevisiae strains containing an additional 
15 copy of the PDIJ gene integrated iuto the yeast genome. Transferrin 
secretion was also investigated from in S. cerevisiae transformed with a 
multicopy expression plasmid, in which the PDU gene has been inserted 
into the multicopy episomal tausferrin expression plasmid in which PDII 
is also inserted (pD3B271 1). 



A S. cerevisiae strain with an additional copy of the PDII gene integrated 
into the genome at the endogenous PDII locus, secreted recombinant 
transferrin and non-gLycosylated recombinant transferrin (N413Q, N6l 1Q) 
at an elevated level compared to strains containing a single copy of PDU. 
A further increase in PDII copy number was achieved by using pDB271 1 
In high cell density fermentation of the strain transformed with pDB2711, 
recombinant transferrin (N413Q, N611Q) was secreted at approximately 
3g.L" 1 , as measured by SDS-PAGB and GP-HPLC analysis. Therefore, 
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increased PDII gene copy number has produced a large increase in the 
quantity of recombinant transferrins secreted from S. cerevisiae. 

The following conclusions are drawn - 

5 

1, In shake flask analysis of recombinant transferrin expression from 
P DB2536 (non-glycosylated transferrin (N413Q, N611Q) and pDB2506 
(glycosylated transferrin.) the S. cerevisiae strain Strain A secreted higher 
levels of both recombinant transferrins into the culture supernatant than 

10 control strains. This was attributed to the extra copy of PDII integrated at 
the PDII locus. 

2. Control strain [pDE2711], which contained the PDII gene on the 
multicopy expression plasmid, produced a several-fold increase in 

is recombinant transferrin (N413Q, N611Q) secretion compared to Strain A 
[pDB2536] in both shake flask culture and high cell density fermentation. 

3. Elevated PDII copy number in yeast such as S. cerevisiae will be 
advantageous during the production of heterologous proteins, such as those 

20 from the transferrin family. 

4, pSAC35-based plasmids containing additional copies of PDII gene have 
advantages for the production of proteins from the transferrin family, and 
their derivatives, such as fusions, mutants, domains and truncated forms. 



25 
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CLAIMS 

1 . A method for producing heterologous protein comprising: 



5 (a) providing a host cell comprising a 2 fam- family plasmid, the plasmid 

comprising a gene encoding protein comprising the sequence of a 
chaperone protein and a gene encoding a heterologous protein; 

culturing the host cell in a culture medium under conditions that 
allow the expression of the gene encoding protein comprising the 
sequence of the chaperons protein and the gene encoding a 
heterologous protein; and 

( c ) purifying the thus expressed heterologous protein from the cultured 
15 host cell or the culture medium; and 

(d) optionally, lyophilising the thus purified protein. 

2. The method of Claim 1 further comprising the step of formulating 

20 the purified heterologous protein with a carrier or diluent and 

optionally presenting the thus formulated protein, iti a unit dosage 
form, 

3. Use of a 2pm-fainily plasmid as an expression vector to increase the 
25 production of a fungal (preferably yeast) or vertebrate heterologous 

protein by providing a gene encoding the heterologous protein and a 
gene encoding a chaperone protein on the same 2^m-family plasmid. 
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4. A 2|iTn-family plasmid comprising a gene encoding a protein 
comprising the sequence of a chaperone protein and a gene encoding 
a heterologous protein, wherein if the plasmid is based on the 2 pm 
plasmid then it is a disintegration vector. 



15 



20 



5< A method, use or plasmid according to any preceding claim wherein 
the chaperone has a sequence of a fungal chaperone (preferably a 
yeast chaperone) or a mammalian chaperone (preferably a human 
chaperone), 

10 

6. A method, use or plasmid according to any preceding claim wherein 

m 

the chaperone is selected from AHAl y CCT2, CCT3 3 CCT4, CCT5, 
CCT6, CCT7, CCT8, CNS1, CPR3, CPR6, EPSJ, EROl, EUG1, 
i - FMOl, HCH1, HSPW, HSP12 t HSP104, HSP26, HSP30, HSP42, 
HSP60, HSP78, HSP82, JEM J, MDJI, MDJ2, MPD1, MPD2, PDI1, 
PFD2 t ABCJ, APJJ, ATPU, ATP12, BTT1, CDC37, CPR7, HSC82, 
KAR2, LHS1, MGE1, MRSI1, NOB J, ECM20, SSA1, SSA2, SSA3, 
SSA4, SSC1, SSE2, S2Z1, SLS1, UBI4, ORM1, ORM2, PERI, PTC2, 
PSE1 and HACJ or truncated intronless HAC1. 



7. A method, use or plasmid according to any preceding claim wherein 
the chaperone is protein disulphide isomerase. 

8. A method, use or plasmid according to any preceding claim wherein 
25 the heterologous protein comprises a leader sequence" effective to 

cause secretion in yeast . 

9. A method, use or plasmid according to any preceding claim wherein 
the heterologous protein is a eukaryotic protein, or a fragment or 
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variant thereof* preferably a vertebrate or a fungal (preferably a 
yeast) protein. 



I Oh A method, use or plasmid according to any preceding claim wherein 
the heterologous protein is a commercially useful protein. 



11. A method, use or plasmid according to any preceding claim wherein 
the heterologous protein comprises a sequence selected from 
albumin* a monoclonal antibody, an etoposide, a serum protein (such 

to as a blood clotting factor), antistasm, a tick anticoagulant peptide, 

transferrin, lactoferrin 7 endostatin, angiostatin, collagens, 
immunoglobulins, Fab' fragments. F(ab , )2 jl scAb, scFv 5 interferons, 
interleukins 3 IL1Q, IL11., IL2, interferon a species and sub-species ? 
interferon P species and sub-species, interferon y species and sub- 

15 species, leptin, CNTF, CNTFax^ , IL1 -receptor antagonist, 

erythropoietin (EPO) and EPO mimics, thrombopoietm (TPO) and 
TFO mimics, prosaptide, cyanovirin-N, 5-helix, T20 peptide, T1249 
peptide, HJ.V gp41„ HIV gpl20, urokinase, prourokinase, tPA, 
hirudin, platelet derived growth factor, parathyroid hormone, 

20 promsulin, insulin, glucagon, glucagon-like peptides, insulin-like 

growth factor, calcitonin, growth hormone, transforming growth 
factor p, tumour necrosis factor, G-CSF, GM-CSP, M-CSF, FGF, 
coagulation factors in both pre and active forms, including but not 
limited Lo plasminogen, fibrinogen, thrombin, pre-thrombin, pto- 

25 thrombin, von Willebrand's factor, ai-antihypsin,, plasminogen 

activators, Factor VII, Factor VIII, Factor IX, Factor X and Factor 
XIIL nerve growth factor, LACI S platelet-derived endothelial cell 
growth factor (PD-ECGF), glucose oxidase, serum cholinestcrase, 
aprotinin, amyloid precursor protein, intcr-alpha trypsin inhibitor, 
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antithrombin III, apo-lipoproteiti species, Protein Q Protein S, or a 
variant or fragment of any of the above. 

12. A method, use or plasmid according to any preceding claim wherein 
the heterologous protein comprises the sequence of albumin or a 
variant or f ragment fhereof. 



13* A method, use or plasmid according to any preceding claim wherein 
the heterologous protein comprises the sequence of a transferrin 
10 family member, preferably transferrin or iactofenin, or a variant or 

fragment thereof. 



14, A method, use or plasmid according to any preceding claim wherein 
the heterologous protein comprises a fusion pmtein, sncfe as a fusion 
15 protein of albumin or a transferrin family member or a variant or 

fragment of either, fused directly or indirectly to the sequence of 
another protein. 



15, A host cell comprising a plasmid as defined by any preceding claim. 



20 



16. A host cell according to Claim 1 5 which is a yeast cell. 



17- A host celL according to Claim 1 6 in which the plasmid is based on 
pSRl, pSB3 or pSB4 and the yeast cell is Zygosaccharomyces 
25 rotfxiU the plasmid is based on pSBl or pSB2 and fhe-yeast cell is 

Zygosaccharomyces baffli, , the plasmid is based on pSMl and the 
yeast cell is Zygosaccharomyces fermentati* the plasmid is based on 
pKDl and the yeast cell is Kluyveromyces draxaphilorum, the 
plasmid is based on pPMl and the yeast cell is Pichia 
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m ernb ran aefaciens , or the plasmid is based on the 2(im plasmid and 
the yeast cell is Saccharomyces cerevisiae or Saccharomyces 
carlsbergensis* 



IS. A host cell according to Claim 17 in which the pjasrpid is based on 
the 2(Lim plasmid and the yeast cell is Saccharomyces cerevisiae or 
Saccharomyces cartsbergensis* 
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ABSTRACT 



MODIFIED PLASMID AND USE THEREOF 



5 The present invention provides a method for producing heterologous protein 
comprising: 

(a) providing a host cell comprising a 2 |um- family plasmid, the plasmid 
comprising a gene encoding a protein comprising the sequence of a 



(b) culturmg the host cell in a culture medium under conditions that 
allow the expression of the gene encoding the chaperone protein and 
the gene encoding a heterologous protein; and 

15 

(o) purifying the thus expressed heterologous protein from the cultured 
host cell or the culture medium; and 

(d) optionally, lyophilising the thus purified protein. 
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chaperone protein and a gene encoding a heterologous? protein; 
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Figure 1 
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Figure 9 

mHSA-pre = modified HSA-pre leader sequence 
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